1 
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Stephen Parrot10 

September 30, 2009 
Abstract 

A precise definition of "weak [quantum] measurements" and "weak 
value" (of a quantum observable) is offered, and simple finite dimensional 
examples are given showing that weak values are not unique and there- 
fore probably do not correspond to any physical attribute of the system 
being "weakly" measured, contrary to impressions given by most of the 
literature on weak measurements. 

A possible mathematical error in the seminal paper introducing "weak 
values" is explicitly identified. A mathematically rigorous argument ob- 
tains results similar to, and more general than, the main result of that 
paper and concludes that even in the infinite-dimensional context of that 
paper, weak values are not unique. This implies that the "usual" formula 
for weak values is not universal, but can apply only to specific physical 
situations. 

The paper is written in a more pedagogical and informal style than 
is usual in the research literature in the hope that it might serve as an 
introduction to weak values. 

1 Introduction 

I have just spent several months pondering the implications of a 2 cm stack of 
recent papers concerning "weak measurements" in quantum mechanics. They 
include papers originally introducing this concept, various rederivations and 
extensions, experimental "weak" measurements, and their use to supposedly 
resolve "Hardy's paradox" . 

I have come to the conclusion that the concept of "weak measurement" , as 
presented in this literature, cannot withstand careful scrutiny. That does not 
mean that there can be no truth or value in it, but that generally accepted 
interpretations of its meaning are mistaken. This note explains why. 

It will be written from a semi- historical viewpoint. "Semi" because it seems 
clearest to first introduce the basic ideas in a simple, finite-dimensional con- 
text. Then we shall describe the more complicated infinite-dimensional situ- 
ation in which they were first introduced, resulting in the widely quoted for- 
mulas {■>pf,Aijjtn)/{i>f,^in) or ^{{■ipf,Aipin)/{'>pf,ipin)) for the "weak" value of 
the observable A, given that the pre-measurement state was tpi, and that after 
measurement the state was "post-selected" to be ■;/'/■ (All of these terms will be 
precisely defined later; they are included here only to orient readers who may 
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have some prior acquaintance with weak values.) Then we shall observe that 
though the concept of "weak measurement" may have some valid uses, we see 
no good reason to think that the results of most "weak" measurements will be 
given given by one of these formulas. 

2 Reader's guide to this paper 

The aim of this paper is to share what I have learned in my study of "weak 
values" over the past few months. It is written primarily for physicists. (I am 
a mathematician.) It is hoped that it could serve as an introduction to "weak 
values" . In order to avoid possible misunderstandings, it begins at a level more 
elementary than would typically be assumed by a journal article. However, the 
reader is assumed to be thoroughly familiar with quantum mechanics, partic- 
ularly in finite dimensions at the level of the book Quantum Computation and 
Quantum Information by Nielsen and Chuang [2]. Readers who are already 
familiar with "weak values" can skim or skip the introductory Sections 3 and 4. 

The main ideas of the paper are presented in Section 5, "A simple finite- 
dimensional model" . The main new result is the observation that weak values 
are not unique. I suspect that for many readers, this will be all they care to 
learn about weak values. 

The presentation of that section may seem quite different from most pre- 
sentations in the literature. It would be natural for a reader unfamiliar with 
weak values to wonder if it is oversimplified, particularly since the model is fi- 
nite dimensional, whereas most presentations in the literature are set in a more 
complicated infinite dimensional context. Section 6 discussing the traditional 
approach is written for such readers. It obtains the usual formula for a weak 
value in the traditional infinite dimensional context in what we hope is a math- 
ematically rigorous way. 

So-called "postselection" is fundamental to the traditional approach. We 
have described it in a mathematically precise way but have given no examples. 
That is because despite the mathematical simplicity of the concept, we do not 
know of any physically realistic examples which could be presented using a 
reasonable space and without elaborate diagrams (which we lack the facilities 
to prepare efficiently). The most enlightening examples we have seen are related 
to "Hardy's paradox" . Readable descriptions of that can be found in T3] and 
[5]. 

Section 9 presents another simple proof that "weak values" are not unique. 
Since it is closer to the methods common in the literature (such as the seminal 
paper [3] of Aharonov, Albert, and Vaidman) than the previous exposition, it 
may be more congenial to those already familiar with weak values. As described 
there, it was discovered after the rest of the paper had already been written and 
typeset. Were I writing it again, I might try to find a way to put this new 
proof at center stage, but at this stage, it doesn't seem worth the effort. The 
exposition as originally written still seems more suitable for beginners. 

Appendices 1 and 2 discuss mathematical subtleties which I suspect will be 
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of limited or no interest to most physicist readers. Since I had to work them out 
to be confident that the mathematics was rigorous, I thought I might as well 
include them. 

3 Notation and point of view 

We attempt to stay as close as possible to traditional physics notation, reverting 
to notation more common in mathematics only when it seems less ambiguous 
or complicated. The inner product of vectors v,w in a, complex Hilbert space 
H will be denoted {v,w), with the physics convention that this be linear in the 
second variable w, and conjugate-linear in the first variable v. The norm of 
a vector v will usually be denoted as \v\ := but (with due notice to 

the reader) this may occasionally be changed to ||u|| when dealing with Hilbert 
spaces of functions, to distinguish the norm of a function from its absolute 
value. 

The mathematics of quantum mechanics describes a (pure) state of a physical 
system as a ray in a complex Hilbert space, a "ray" being defined as the set 
of all multiples av, a complex, of a nonzero vector v. Thus a ray is uniquely 
described by any nonzero vector in it, which may be normalized to norm 1 if 
desired. In a context of detailed calculations, normalization is often helpful, 
but otherwise it may merely introduce nuisance numerical factors. We shall 
normalize only where it seems helpful. Also, we follow convention by speaking, 
regarding a vector v, of the (pure) "state" v instead of the more pedantic "ray 
determined by w" . 

The projector to a subspace E will typically be denoted Pe, in place of the 
common but unnecessarily complicated physics notation J2i |ei)(ei|, where {ej} 
is an orthonormal basis for E. When E is the entire Hilbert space of states, 
Pe is called the identity operator and denoted I := Pe- When E is the one- 
dimensional subspace spanned by a vector w, we may write P^ for Pe. We shall 
make constant use of the formula for P^ when \w\ — 1: P^v — {w,v)w. Note, 
however, that under our convention, P^ = Pw/\w\, so this formula for P^ only 
applies for \w\ = 1. 

Our mathematical formulation of quantum mechanics follows that of Chap- 
ter 2 of the book of Nielsen and Chuang [2]. We shall consider only projective 
measurements (as opposed to the more general "positive operator valued" mea- 
surements emphasized there). 

Given a system in a pure state and a subspace E, the projective measurement 
associated with E answers the question: "After the measurement, is the state 
of the system in E or the orthogonal complement E-^ of El" . (According to 
usual interpretations of quantum mechanics, those are the only possibilities for 
such a projective measurement.) 

If before that projective measurement the system was in pure state v, then 
after the measurement it will be either in (unnormalized) state Pev or {I—Pe)v, 
with respective probabilities |PEu|VkP and |(/- Pe)v\'^ /\v\'^ ([2] P- 87 ff.). 
The measurement is sometimes said to "project" the pre-measurement state v 



4 



(which may be neither in E nor its orthogonal complement) into either E or 
its orthogonal complement H When E is the one-dimensional subspace spanned 
by a vector w, the measurement is said to determine if the post-measurement 
state is w, which for normalized states v and w will occur with probability 
l^juf P = \ {i',w)\'^. Note also that when E is the entire pure state space, so that 
Pe = I, the measurement has no effect on the system. 

More generally let Ei,E2, ■ ■ ■ be a finite or infinite collection of pairwise 
orthogonal subspaces with -^Bi = ^- The collection of projectors {-Pe^} 
is called a resolution of the identity. With any such collection is associated a 
projective measurement whose result is to "project" an initial (pre-measurement) 
pure state v onto one of the Pb^v with probability IPg.vp/lwp. 

The above refers to pure states. We shall deal mainly with pure states, but it 
is sometimes difficult to discuss measurements of pure states without introducing 
the slightly subtle concept of "mixed" state. For simplicity of exposition, we 
shall review this concept in a context less general than necessary, but as general 
as we shall need it. 

Let the unit vectors vi,V2t-- represent a (finite or countably infinite) col- 
lection of pure states and pi,p2, - ■ ■ corresponding probabilities which sum to 1. 
Suppose we decide to choose randomly one of the pure states Vi with probability 
Pi and then perform some measurements on the chosen state. Just before the 
random choice, we say that the system is in a mixed state which we'll denote by 
the traditional symbol p. Mathematically, a mixed state p is described by a pos- 
itive operator of trace 1, traditionally called a density matrix: for the above ex- 
ample, p — J2i PiPvi ■ If after the random choice we measure to see if the (pure) 
state of the system lies in a subspace E, the probability that it does lie in E is 
tr (pPe). When all but one of the probabilities pi vanish, say the nonvanishing 
probability is pk = 1, then p = Py^ and tr (pPg) = tr {P^^^Pe) = \PEVk? /\vk\\ 
which is the same as the corresponding formula for a system in pure state Wfeu 

For example, suppose the Hilbert space of pure states is two-dimensional 
with orthonormal basis ei,e2, and consider a measurement to determine if a 
particle with initial normalized state v is (after the measurement) in state ei or 
62- (This corresponds to a projective measurement with respect to the resolution 
of the identity Pei , Pe2 ■) If we know that the measurement has been made but 
have not been told the result, then from our point of view the system is in 
a mixed state: it is in pure state ei with probability jPejUp = |(ei,w)p and 
in pure state 62 with probability |(e2,w)p. But if we know the result of the 
measurement, then the system is in a pure state, either ei or 62, according to 
the result. A subtle aspect of the concept of "mixed state" is that the state 
of the system may depend on the observer: an observer who knows the result 
of a measurement will "see" a pure state, but the state will be mixed from the 
perspective of an observer who knows that a measurement has been made (or 

■^This terminology is suggestive but slightly misleading because to "project" v in this sense 
is not the same as applying a single projector to v. 

^ Since our formulation of "mixed state" does not exclude the case of exactly one nonvan- 
ishing Pi, a pure state is technically a mixed state. Some authors use "mixed state" to denote 
what we would have to call a "non-pure mixed state" . 
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will be made) but does not know the result. 

4 The problem which "weak" measurements ad- 
dress 

Consider the following trial of an experiment which is to be repeated many 
times. A quantum system S is prepared in a given initial state Si„. We have an 
observable (Hermitian operator) A : S ^ S. For simplicity, assume that A has 
a complete set of eigenvectors. We would like to measure the expectation of A 
in the state Si„, namely {sm, Asin), without changing the state of S (from Sm 
to something else). 

What was just proposed may seem impossible because measuring A consti- 
tutes a projective measurement which will project Sm onto one of the eigenvec- 
tors of A (thus changing it, unless Si„ happened to be an eigenvector to start). 
However, we did not say we needed to measure A, but only its expectation in 
the state Sm, which is not quite the same thing. And we are willing to weaken 
the requirement of not changing Sm to only changing Si„ negligibly. 

Why would we want to do this? Well, we might want to perform a subsequent 
experiment on Sm and speak of "the expectation of A in the state Sm given 
that" the subsequent experiment yielded a particular result. Classically, this 
would be called a conditional expectation of A given the particular result of the 
subsequent experiment. 

In practice, the "subsequent experiment" is usually a so-called "postselec- 
tion" to a given final state s/. We measure {sin, Asm) while changing Sm 
neglibibly and then perform a projective measurement to see if the final state 
is s/ (as described in a preceding section). If the final state is sj, we record 
the measurement of {sin, Asin); otherwise we discard it. The average of all the 
recorded measurements {sin,Asin) is taken as (a good approximation to) the 
average value of A in all states Sm which project to s/ in the final step. Classi- 
cally, this would be called the conditional expectation of A given that the final 
measurement yielded s / . 

The reader who feels a sense of unease at this proposal should be congrat- 
ulated on hifl perspicuity. Indeed, serious objections can be raised to the pro- 
posal, but we are presenting it as it seems typically (though usually not so 
explicitly) presented in the research literature. Objections will be discussed 
later. Right now we are trying to present a rapid overview. 

Part of the above proposal is both sound and experimentally feasible. A 
seminal paper of Aharonov, Albert and Vaidman [3] cleverly expands a theory 
of measurement of von Neumann to accomplish measurement of (si„, Asin) while 
negligibly changing the state of the system. 

*or her, of course. We follow the long-standing and sensible grammatical convention that 
either "his" or "her" means "his or her" or "her or his" in contexts like this. We find it 
distracting to alternate "his" with "her", and make no attempt to equalize the number of 
each. 
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They couple the original system S to an auxiliary "meter" system M in 
such a way that a particular "meter" observable in the meter system has the 
same mathematical expectation as A, and in addition the coupling between the 
meter and the original system 5* is so weak that measuring the meter observable 
negligibly affects the state of S. 

This involves a tradeoff between weakness of the coupling and accuracy 
of individual meter measurements. A weaker coupling generally entails wider 
dispersion of the meter measurements. The average of a large number of meter 
measurements will approximate {sm, Asin) , but a very large number may be 
required when the coupling is very weak. 

5 A simple finite-dimensional model 

5.1 Introduction of the model 

Wc present a simple model which illustrates how weak measurements can be 
performed. Let S denote the Hilbert space of the system which is the primary 
object of study, and M the Hilbert space of an auxiliary "meter" system. We 
will take both S and M to be two-dimensional with orthonormal bases so,si 
for S and mo, mi for M . No confusion should result if we use the same symbol 
S for the physical system itself and for its Hilbert space, and similarly for M . 

The Hilbert space of the composite system comprising both S and M is the 
tensor product S(i)M, which has the orthonormal basis {si^rrij | < i, j < IjH 
If the system S is in state s and system M in state m, then the composite system 
is in state s®m, and conversely. Not every state of S®M can be written in the 
form s®Tn. Those which can be written in this way are are called product states, 
and those which cannot are called entangled. An example of a state which can 
be proved to be entangled is sq ® mo + si ® mi. 

5.2 The states of S and of M derived from the state of 

When the composite system SiS)M is in an entangled pure state, each individual 
system S or AI is in mixed state, which is never pure. This follows from the 

^This is a note for the inexperienced reader, and also to bridge tlie gap between the 
traditional physics notation of our primary reference [2] and our slightly different notation 
(which we think simpler for the kind of calculations which we shall be doing). 

One definition of "tensor product" of the above S and M (the simplest but not necessarily 
best definition) is as a 2 X 2 = 4 dimensional Hilbert space with orthonormal basis denoted 
as above and with a binary operation ® which assigns to any s G S and m, £ M an element 
denoted s(S)m of S®M, {s,m) i— > s®m, such that ® satisfies the distributive rules of algebra: 
for any scalars ct; , fij , 

(aoso + uisi) ^ (pomo + ^J.lml) = aofio so (8imo -t-fOMl so^mi+rrifio si ^mo+crifJ.i si (g)mi. 

Typical physics notation for Si (g) nij might be something like \i)s\j)M. In this notation, 
the tensor product symbol is omitted. It could be omitted in our notation, too, but it seems 
to enhance readability. 
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general rule ([2], P- 107) for passing from a mixed state p (which could be pure) 
of the composite system S ® M to states of its factors S and M. The rule is 
that the state of S is obtained by taking the partial trace with respect to M of 
yO, denoted tr^f p. A consequence of this rule is that when (g) M is in a pure 
state, a (projective) measurement in Ad can affect the state of S if and only if 
the state oi S ® M is entangledH 

Appendix 1 reminds the reader of the definition and properties of the partial 
trace. One very useful calculation done there encapsulates most of the properties 
of the partial trace which will be used in our discussion of weak values. It goes 
as follows. Let {rrij} be an orthonormal basis (finite or infinite) for M and r 
a pure state va. S ® M with |r| = 1 It is easy to show that r may be uniquely 
written as 

r ~ ^ Si (g) TTii with Si e S and J2i — 1- (1) 

i 

(The Si need not be orthogonal.) Considered as a mixed state, r is represented 
by the density matrix Pr, and the corresponding mixed state of S is tiM Pr- 
The calculation yields 

trAfP, = 5]|s,|'-P.. , (2) 

i 

which explicitly exhibits the mixed state of S corresponding to the pure state 
r of 5* (g) M as a convex linear combination of pure states of S. This rule for 
passing from a pure state of 5* (g M to a mixed state of S will be illustrated by 
an example which will also illustrate other subtleties. 

Take both S and M to be two-dimensional with orthonormal bases sq^si 
and mo, nil, respectively. Suppose we perform the following experiment a large 
number of times. Start with the composite system 5 Af in the normalized 
entangled state 

e := (so (g) mo + si (g) mi)/\/2. 

Then perform a projective measurement in the meter system to determine if 
it is in state mo or mi. (This corresponds to a projective measurement in the 
composite system with respect to the resolution of the identity / (E) Pmo , I 
Pmi-) The result of the projective measurement will be mo with probability 
|(/ (g) Pmo)ep = 1/2, and in this case the post-measurement state will be the 
product state (/ (g Pmo)e — so(E mo/V^- (The observable / (g Pmg corresponds 
to measuring Pmg in the meter system and nothing in the S system.) The 

®It is immediate that if a pure state of S ^ M is not entangled (i.e., is a product state 
s (3 m), then a measurement in M cannot affect the state s of S. To see that if a pure 
state is entangled, then a measurement in M will change the state of S, let mo, mi, ... be an 
orthonormal basis for M. Then a general normalized pure state r of 5 M can be written 
uniquely as r = ^ . Si Cg) m;, with £ S satisfying ^ , = 1. 

A projective measurement in M with respect to the basis {m^} corresponds to a projective 
measurement in S ^ M with respect to the resolution of the identity {/ ® Pmi }■ This results 
in outcome m^ occuring with probability |sj.p, and when m^ does occur, the resulting state 
is the pure state (/ (g) Pmi,)r = Sfe ® m^.. That is, the measurement changes r into m^ 
with probability |sfcp. In order for this not to be an actual change, r must be a multiple of 
Sk mfc, which says that r is a product state. 
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corresponding state of S expressed as a density matrix is tvM Pso^mo = -Psoi 
which expressed as a vector state is sq. Similarly, it will be found to be uii with 
probability 1/2, in which case the post-measurement state will be Si ® toi, with 
S in state Si. 

This seems to imply that a measurement in the meter system can affect the 
state of S, and indeed it can, according to what seems the generally accepted 
physical interpretation of the mathematics. (This seems the interpretation of 
[2] and is also our interpretation.) But the way in which it affects the state of 
S is subtle and worth further examination. 

In principle, the two systems S and M could be spacelike separated, so 
that, according the principle of relativistic locality, a measurement in M at a 
particular time (in some Lorentz frame) could not affect a measurement in S at 
the same time. That implies that if the S system can be said to have a definite 
state, that state cannot be affected by a particular M measurement. This seems 
to contradict the interpretation which we adopted in the preceding paragraph. 

The contradiction is one manifestation of a pervasive tension between "real- 
ism" and "locality" , two philosophical concepts which are typically only vaguely 
defined in the literatureO It would seem that if we accept that system S must 
have a definite state (one aspect of "realism" ) and that a measurement in a 
spacelike separated M at a particular time can affect that state at that time, 
then the principle of locality cannot hold. 

The most common resolution of this contradiction seems to be to abandon 
"locality" as a general principle (i.e., reject the proposal that measurements in 
the meter system at a particular time cannot influence the S system at the same 
time) and replace it by the weaker statement that measurements in the meter 
system cannot be used to send superluminal messages (i.e., messages which 
travel faster than light) to the S system. 

That removes the contradiction, but leaves one a little uneasy that although 
there seems no obvious way that the mathematics of quantum mechanics could 
facilitate superluminal communication, nevertheless some clever person might 
someday find a way to do it. 

We will accept this resolution because without something like it, we could 
not use the accepted physical interpretations of the mathematics of quantum 
mechanics. The resolution applies as follows to the experiment introduced above 
of measuring for mo or mi in the meter system. 

Perform the experiment a large number of times, and divide the outcomes 
into two classes, those which resulted in toq (Class 0) and those resulting in mi 
(Class 1). Then any measurements made in S for the Class states will have 
identical statistics to measurements made in a single system S (i.e., forgetting 
entirely about M) in state sq, and similarly the Class 1 states will have statistics 
identical to those of system S in state si. The combined classes will have 
statistics identical to those of a mixed state which is in state sq with probability 
1/2 and si with the same probability. 

^Including this literature! We are not trying to prove a theorem here, but to communicate 
a point of view. An attempt at careful definitions (which would not be easy) would only be 
a distraction. 
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Although the S observer may not reahze it, half the time he is measuring 
in state sq, and half the time in si. Considered as a density matrix (trace 
one positive operator) on 5, his state is (1/2)^5^ + {l/2)Pg^ = 1/2 until he is 
informed of the result of the m,eter m,easurement. If the measurement in the 
meter system results, say, in outcome TOq, and if the S observer is informed 
of this fact, then he will condition his future calculations on this fact. When 
so conditioned, his state becomes Pgg (considered as a density matrix) or sq 
(considered as a vector). 

Only in this weak statistical sense, can the existence or nonexistence of 
a measurement in M (and its result) influence the state of S. But for S to 
perceive this influence seems to require "classical" (i.e., not faster than light) 
communication between M and 5. 

5.3 Introduction to "weak" measurements 

We have noted that when (g) M is in a pure state s (g) m, a projective mea- 
surement in M cannot affect the state of S. More generally, the effect of a 
measurement in M on the state of S is expected to be arbitrarily small if the 
pre-measurcmcnt pure state of S (g M is close enough to a product state. To 
make this precise, we would have to commit ourselves to definitions of "effect of 
a measurement in M" and "close enough to a product state" . When the Hilbert 
spaces S and M are finite dimensional, so are the linear spaces in which the 
mixed states reside, and since all norms on a finite dimensional normed linear 
space are equivalent, closeness can be measured in any convenient norm. 

For example, we could measure the distance between two mixed states p, a 
using the trace norm: ||p — cr := •\/tr (p — (t)^. The reader who wants a 
definite definition may use the one just given, though other norms may be 
easier to calculate in specific contexts. 

For pure states in S M, the reader will probably find it easier to think in 
terms of the Hilbert space norm instead of the trace norm. (We didn't formulate 
our definition in terms of the Hilbert space norm because we needed it to apply 
also to mixed states, in order to talk about the "effect of a measurement in M 
on the (generally mixed) state of S".) 

In infinite dimensions, more care would be necessary. These issues seem not 
to have been considered in the literature on weak measurements. We do not pur- 
sue them here because our aim is to explain the concept of weak measurements 
in the context in which they historically arose, in which technical mathematics 
would only be a distraction. 

For the rest of this section, we assume that all Hilbert spaces occurring are 
finite dimensional. This implies that all Hermitian operators occurring can be 
written as a linear combination (with coefficients the eigenvalues of the operator) 
of orthogonal projectors. 

With these preliminary observations out of the way, we can describe the 
idea of weak measurement in a more precise way. Suppose we are given a repro- 
ducible, normalized pure state s of S. By "reproducible" we mean that we have 
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an apparatus that can produce any number of replicas of this stateH Suppose 
we are also given an observable (Hermitian operator) A on S. Our goal is to 
determine the expectation {s,As) of A to arbitrary accuracy by making mea- 
surements only in M which have arbitrarily small effect on the pre-measurement 
state s of S*. We shall call these "weak" measurements H 

We now describe a simple way to do this. Let mo, mi be an orthonormal 
basis for M. Let e > be a real parameter which will measure the strength 
of the effect of a "meter" measurement in the mo, mi basis on the state of S. 
Here we are using the descriptive phrase "meter measurement in the mo, mi 
basis" as shorthand for a projective measurement relative to the resolution of 
the identity {/ (g) Pmo , I <8) Prm } in 5" (g) M. 

First we consider the special case in which A is both Hermitian and unitary 
(as are the Pauli matrices, for example). Let s be a normalized pure state of S, 
and mo, mi an orthonormal basis for M . Start with the pure state of S (E> M 



r := r(e) := s (g) mo vT— + As TOie . (3) 

The assumptions that A is unitary and s is normalized imply that r is normal- 
ized: |r| = 1. 

From the point of view of S, this state is very close to the given state s when 
e is small. More precisely, the partial trace with respect to M of Pr, ti'M Pr, is 
arbitrarily close to Ps for e sufficiently close to 0. 

Let 

^00 Boi 
Bio Bii 



B := 



(4) 



be the matrix with respect to the mo, mi basis of a Hermitian operator B on M. 
Our main conclusion will require the assumption that Boo = 0, but we initially 
include an arbitrary Boo for purposes of later discussion. 
A short calculation reveals that 

(r(e), (/ ® B)r{e)) = Boo{l - e') + 2eVl-e^s, As)5R(Bio) + e^{As, As)Bii . 
Under the assumption Boo = Oi 

hm Me),(/^We)) ^ 2(.,A.)5i(i?,o) . (5) 

This says that when 5Ri?io 7^ 0, measuring the average value of i? in M and 
normalizing appropriately (by dividing by e) will approximate the average value 
of A in state s of up to an inessential constant factor. Moreover, we shall 



* This does not contradict the supposed impossibility of cloning quantum states (under 
certain restricted hypotheses a "no-cloning" theorem can be proved) because we do not require 
to be able to copy any unknown state given to us, only that we have a device which can reliably 
produce some particular state in which we are interested. 

^In the literature, the term "weak measurement" may sometimes refer to experiments which 
involve not only weak measurements in our sense, but also "postselection" to be described 
below. 
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show in a moment that this approximation is obtained with neghgible change 
(for small e) in the state s of S. 

Although we assumed that A was unitary in order to make the algebra sim- 
ple, similar weak measurement protocols can easily be obtained for arbitrary 
Hermitian operators A (on a finite dimensional S). The following are two pos- 
sible approaches. 

1. If P is a projector, then U 2P — / is unitary (and Hermitian), and 
P = (U + l)/2. Since we showed above how to obtain {s,Us) to arbi- 
trary accuracy by measurements in M, the simple algebraic transforma- 
tion {s,Ps) = {s,Us)/2 + 1/2 gives us (s,Ps) with the same (negligible 
for small e) disturbance of the state S. The case of general Hermitian A 
is then obtained by writing A as a linear combination of projectors. 

This approach has the advantage of conceptual simplicity, but may be 
inconvenient in specific applications. 

2. The case of general Hermitian A can also be obtained by normalizing ([5]), 
i.e., replacing r(e) in ([3]) by the normalized state 

_ r(e) s (g) toqVI - e'^ + As® mie 

■= I / M = , . (6) 

\r{e)\ ^1 - + e^\As\^ ^ ' 

Though the algebra is slightly messier, (O with f(e) in place of r(e) is still 
obtained: 

lunMlilMm . 2(.,A.)5J(B,o) . (7) 

Taking Sio := 1/2 Bqi, we see that measuring the average value of B 
relative to e will yield the average value of A in state s to arbitrary accuracy. If 
the measurement of B has arbitrarily small effect on the state of 5* for sufficiently 
small e, then B "weakly" measures {s,As). (This is essentially our definition of 
"weakly measures", which will be formalized in Definition 1 below.) 

It is not hard to see that for small e, measuring B does not appreciably 
affect the state of S. For small e, f (e) is arbitrarily close to siS)m, and the corre- 
sponding mixed state of S, namely tiM Pf, is arbitrarily close to trjv/ Psi^m = Ps 
(which is the density matrix equivalent of vector state s). Here we have used the 
facts that in finite dimensions, r i— > P,. (|r| — 1) and p i— > trM p are continuous 
relative to the topologies induced by any norms. 

Let bo, bi be orthonormal eigenvectors of B. The measurement of B will 
project f(e) onto either (/(g)Pb„)f(e) or (J^Pfc^ )r(e). For small e, this projection 
will be arbitrarily close to either (/ (g) Pfco)(s (g to) = s (g) Pb^TO or s (g) Pb^m, 
respectively. Taking the first alternative for convenience, we may conclude that 
the post-measurement state of S, namely the density matrix ti m P(i®Ph^)ri is 
arbitrarily close to trjv/ Pii0Ptg)is0m) = trM Ps^Pbg-m = Ps- This shows that for 
small e, measuring B has arbitrarily small effect on the state of S. 

Measuring the average value of A without significantly changing the state of 
S may may seem too good to be true, but close scrutiny reveals that a tradeoff 
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is involved. Measuring the average value of B requires measuring a very small 
quantity (on the order of e). To obtain a reliable average value for B we will 
need to average a large number of individual observations. The smaller e, the 
larger the number of observations required. (From general statistical theory, we 
expect that the required number of observations should scale like 1/e^.) 

In an ideal situation in which measuring B produces only random errors, the 
random errors will average to zero, and the procedure described will be feasible. 
But systematic errors need not average out and can make the weak measurement 
infeasible when e is too small. This drawback, or tradeoff, is implicit in all "weak 
measurement" schemes which we have seen. 

For example, in a Stern-Gerlach experiment to measure the spin of particle, 
the slightest misalignment of the magnets can produce a systematic error which 
will cause the limit e — > in ([5]) to fail to exist. Mathematically, this shows up 
in the necessity of the assumption Bqq — 0. In a physical experiment, there is 
generally no way to assure that Bqo will be exactly zero. But if Bqq is small, we 
can hope to obtain meaningful experimental results consistent with ([5]) when e 
is not too small. 

Since we shall often refer to "weak measurements" , we offer the following 
physical definition of such measurements. 

Definition 1 Let S be a quantum system and A an observable on S. Let s 
be a state of S of which we can make an arbitrary number of copies. A weak 
measurement protocol is a procedure which can determine the expectation (s, As) 
of A in the state s to arbitrary accuracy while perturbing each copy of s used in 
the procedure by an arbitrarily small amount. 

Weak measurements are of particular interest when we want to use the 
(copies of the) state s used in the weak measurement for future experiments. If 
we had measured observable A in state s in the normal way, it would project s 
onto one of the eigenvectors of A, thus changing s to that eigenvector. 

A kind of "future experiment" which has received great attention in the 
literature is "postselection" . Indeed, interest in weak measurements stems from 
the seminal paper of Aharonov, Albert, and Vaidman [3] which considers weak 
measurements in the context of postselection and coins the term "weak value of 
a quantum variable" . 

Let / be a desired "final state" in S'0 A "postselection" experiment fol- 
lows a weak measurement protocol by a projective measurement relative to the 
resolution of the identity Pf (g) Im, {Is ^ Pf)^ Im), where the subscripts on the 

10 Previously, the initial state was called Si„ and the final state Sf to correspond more 
closely to the notation ipi„ and commonly used in the literature. But both notations are 
unnecessarily complicated. The only additional information contained in sj as opposed to / 
is an implicit suggestion that it probably refers to a state in S. But since the final state / 
always occurs in expressions like f ® m a S ® M which explicitly imply the suggestion, this 
additional information is redundant and complicates the typesetting. 

While on the subject, we also warn the reader that earlier versions of this paper used q instead 
oi f or Sf for the final state. Should the reader encounter a g in the context of a final state, 
it is probably a typo. 
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various identity operators denote the spaces on which they act. (In the future, 
we shall generally omit the subscripts, writing, for example, Pf ^ I, {I — Pf) ^ I 
for the above.) 

Suppose the state of 5 (8) M after the weak measurement protocol is a pure 
state (an atypical situation, but the easiest to discuss). Then informally, the 
postselection asks (and answers) the question: "Is the state of S ^ M after 
postselection in [/] (g) M or [/]^ ® M, where [/] denotes the subspace spanned 
by /, and [f]^ its orthogonal complement. If the postselected state of S ^ M 
is in [/] (g) M, then it is of the form / (g) r with r e M, and the corresponding 
state of S is /. Otherwise, the state of S" is a mixture of pure states which are 
orthogonal to /. 

"Postselection" discards cases in which the postselected state of S is not the 
pure state /. This amounts to replacing probabilities by probabilities condi- 
tional on successful (i.e., the postselected state of S is /) postselection. 

In the more typical case in which the state oi S ^ M after the weak mea- 
surement protocol is a mixed state p (a density matrix), then after the posts- 
election it wiU be either in state (P/ (g) I)p(Pf ® /)/tr ((P/ ® I)p{Pf (g /)) or 
((/ - Pf) ® I)p{{I - Pf) ® /)/tr (((/ - Pf) ® I)p{{I - P/) ® /))), so the postse- 
lection may be regarded as resolving the question: Does the postselected state 
a satisfy ((J — Pf) ® I)a = (successful postselection) or {Pf (g) I)a = 0? 

The cumbersome language of mixed states can usually be finessed in the 
context of weak measurements in finite dimensions by noting that a sufficiently 
accurate weak measurement protocol starting with initial state s G S leaves S 
in a state close to s. It is routine to show that this implies that the state of 
S(gM is close to (the density matrix analog Ps^r of) a pure product state s(g)r. 
Then one can apply the simpler discussion previously given to the pure state 
s g) r. 

In the literature, the result of measuring the average value of an observable 
in a state s by a procedure which negligibly disturbs the state (such as ([5]) 
above) followed by conditioning on successful postselection to a final state / is 
said to result in a "weak value" of the observable. If in the above example we 
choose Bio '■= 1/2 =: Bqi, we obtain the so-called "weak value" 



The mathematical procedure may be understood as follows. Asking the post- 
selection question "Is S in state / or in a mixture of states orthogonal to /?" 
is the same as measuring Pf g) /, and the expectation of P/ (g / in state f(e) 
is the proportion of "yes" answers (in the limit of a large number of measure- 
ments). Measuring Pf ^ B — {Pf g) /)(/ g) B) corrsponds to simultaneously 
measuring B and postselecting to /; its average is the average of the observable 



^.^(r{e)^(J^^^B)r(^ 
e^oe{r{e),{Pf^I)r{e)) 



{As,Pfs)/2 + {s,PfAs))/2 
{s,Pfs) 

{f,s){AsJ)/2+{sJ){f,As)/2 




(8) 
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whose value is the value of B for successful postselections and zero otherwise. 
The conditional expectation of B given successful postselection is the average 
of the just mentioned observable Pf(S)B divided by the probability of successful 
postselection. This conditional expectation is 0(e), so we normalize by dividing 
hyeE 

The term "weak value" is not used consistently in the literature. The sem- 
inal paper [3] of Aharonov, Albert, and Vaidman introduced the concept of 
"weak value of a quantum variable" in a way conceptually similar to the above, 
but using a very different "meter system" along with a very different measure- 
ment procedure (which will be discussed in the next section). They identified 
{f,As)/{f,s) (a quantity which in general need not be real) as their "weak 
value". Other authors (e.g., [10]) obtain the real part of this as their "weak 
value" (as we did above, using different methods). Some authors seem to de- 
fine the "weak value" of an observable A to be one of these expressions. Most 
of the literature gives the impression that any "weak measurement" (in the 
sense of Definition 1) followed by postselection will result in a "weak value" 
^{{f,As)/{f,s)), no matter what the measurement procedures. One of the 
purposes of this note is to dispel this belief; a counterexample will be given 
below. 

At this point we have to warn the reader that in order to continue with 
the description of the weak measurement process as typically presented in the 
literature (e.g. as in [3]), we need to temporarily use language which we think 
questionable, as will be discussed in detail later. Suppose we start with a state 
s E S, construct f(e) S SiS)M for suitably small e, measure B in M (i.e., measure 
/ (g) B in (g) M), and then postselect to f £ S. We do this as many times as 
necessary to obtain a reliable average of the results (i.e., measurements of B 
followed by successful postselection). That average is taken as an estimate of the 
expectation of B conditioned on postselection to /, which (after normalization 
by division by e) in turn is used as an estimate of the expectation of A in the 
state s, conditioned on the postselection of /. The emphasized phrase 
is problematic, but to the best of my understanding, it accurately reflects the 
meaning of less precise language typical in the literature. 

Why is the emphasized phrase problematic? After all, conditional expecta- 
tions are uncontroversial in classical probability theory. Insight can be obtained 
by digressing to review classical conditional expectations on finite probability 
spaces. 

Let E be a finite probability space, A a subset of E, and X a (real-valued) 
random variable on E taking on values xi,X2,. ■ ■ ,Xm- Then the expectation 

^^In the narrative, we have been speaking of measuring I B then postselecting to /. We 
are modeling this as measuring I ® B and simultaneously postselecting (which is possible 
because Pf I and I (g) B commute), which is the same as measuring Pf ^ B. The result 
of this simultaneous measurement is the result of the I ® B measurement if the postselection 
was successful, and zero otherwise. 

The narrative could be revised to speak of simultaneous measurement of B and postselection, 
but it seems easier to think about first measuring B, then postselecting immediately after. 
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Exp(X) of X, is defined by 

m 

Exp(X) :^ Xip{X = Xj) , 

i=l 

wliere p denotes probability. Note that Exp(X) is necessarily a convex linear 
combination of the values Xi of X. Conditioning on A means passing to a 
new probability space whose underlying set is A, with new probabilities J'a(') 
obtained by dividing original probabilities by p(A): for any subset K of A, the 
new probability p£^{K) := p{K)/p{A). Relative to this new probability space, 
the new expectation of X, denoted Exp(X | A, ), is 

m 

Exp( X| A) :=^x,pa(^ , 

i=l 

which is again a convex linear combination of the values of X. 

The point is that in ordinary probability theory, conditional expectations 
of an observable (i.e., random variable) are always convex linear combinations 
of the values of the observable. In quantum mechanics, the possible measured 
values of an observable on a finite-dimensional Hilbert space are its eigenvalues. 
But ([5]) need not be a convex linear combination of the eigenvalues of A. So it 
seems questionable to think of ((S]) as a conditional expectation (as seems the 
usual interpretation in the literature). 

This is emphasized by the provocative title of Ahaharonov, Albert, and Vaid- 
man's seminal paper [3j: "How the Result of a Measurement of a Component 
of the Spin of a Spin-1/2 Particle Can Turn Out to be 100". The measurement 
they seem to be talking about is not a single measurement (which certainly 
could be far from expected if experimental error is large), but a measurement 
of the average value of the spin, conditioned by a postselection. The authors 
seem to believe that they have explained how the average postselected value of 
a spin-1/2 measurement can be 100. We are not convinced that the measure- 
ments they describe are measurements of the spin, conditioned on successful 
postselection, as will be elaborated below. 

Although the widely accepted ([5]) surely describes some quantum measure- 
ment, we see no reason that it should correspond to conditioning a measurement 
of the average value of A on postselection to /. What it does correspond to for 
our toy model is given precisely by the left side of equation (j8]) , namely condi- 
tioning the meter measurement on postselection to /. The sentence 

1. "The average value of the normalized (i.e., by division by e) meter 
measurement equals the average value of A" 

is true (in the limit e ^ 0) , but the sentence 

2. "The average value of the normalized meter measurement con- 
ditioned on postselection to / equals the average value of A condi- 
tioned on postselection to /" 



16 



is either false, meaningless, or tautological, depending on how it is interpreted. 

In order to speak meaningfully of "the average value of A conditioned on 
postselection to /" , we need to say how this quantity is measured. If we measure 
A in iS*, successfully postselect to /, and average the results, we do not necessarily 
obtain ©, 3fi((/,As)/(/,s)). 

Physically, this is because measuring A can significantly disturb the original 
state s of the system. To explicitly calculate what happens, suppose that A has 
two eigenvalues ai,a2 with corresponding normalized eigenvectors ai,a2, and 
take \s\ = 1. After A is measured, S is in state with probability |(ai,s)p, 
i = 1,2. Subsequently, the postselection succeeds with (conditional) probability 
|(/, ai)p. The total probability that the postselection succeeds is 

2 

J2\{a^,s)m,a.)\' . 

Hence the conditional expectation of A given that the postselection succeeds is 

a^\{a^,s)m,a,)\^ + a2\{a2,s)m,a2W 
|(ai,s)|2|(/,ai)P + l(a2,s)|2|(/,a2)P ' 

It is easy to see that this need not equal ([8]), di{{f, As) / {f, s)). For example, 
(pl is a convex linear combination of ai, Q!2, and therefore cannot be arbitrarily 
large, whereas As) /(/, s)) can be arbitrarily large when / is nearly orthog- 
onal to s and the numerator (/, As) is not close to 0. (Such examples are easy 
to construct). 

Therefore, if sentence 2 is to be true, the "average value of A conditioned on 
postselection to /" cannot refer to normal measurement in S. To what could 
it refer? If it refers to measurement in M of the normalized average value of B 
postselected to /, then sentence 2 becomes a tautology, true by definition and 
containing no useful information.. 

5.4 "Weak values" are not unique 

We think that the strongest argument that the "weak value" 3?((/, As)/ (/, s, )) 
does not correspond to any simple physical attribute of the system S which is 
being weakly measured is that one can obtain different expressions correspond- 
ing to weak measurements in M followed by postselection to /. These different 
expressions will be obtained by reasoning conceptually identical to the reasoning 
which led to ©. 

Consider the setup described above of a system S coupled to a two-dimensional 
meter system M with orthonormal basis mo, mi. Given a Hermitian operator 
A on S and an initial state s of S, we want to approximate (s. As) to arbitrary 
accuracy by measuring the average value of an observable _B on M. The nota- 
tion will be the same as in the previous discussion leading to the "weak value" 
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Let V be any unitary operator on S satisfying Vs = s. In place of the state 
f(e) of S" (g) M in dll), define 

f(e) (F(g)/)f(e) 



^l-e2+£2|yl5|2 



"Ks (g movT— ? + VAs g) mie 



: g) toqVI — + VAs g) mie 



(10) 



The condition Vs = s guarantees that the corresponding state of 5* will approx- 
imate s for small e. 

Define the operator B on M as in ([4]) with Bqq = and Bqi = 1/2 = Biq. 
Since F g) / and I ® B commute, 

(f, (/ (g B)f) = {{V (g /)f , (/ (g (g = (f , (/ g) « e(s, As), 

and 

limffili^^^^l^ = (.,A.) , (11) 

e^O e 

so averaging measured values of / g> B in state f and normalizing by dividing 
by e is a weak measurement protocol in the sense of Definition [TJ 

As in the discussion leading to ([8|) , suppose that after measuring / g) S with 
S* (g M in state f(e), we postselect the state of S" to / G 5 (which as previously 
explained means a successful measurement of P/ (g / in S* (g M). As before, the 
conditional expectation E^{B\f) of B given success of the postselection is 

E.m) (12) 

(r, [Ff (g l)r) 

(1 - e2)goo(s, Pfs) + 2eVT^jR(gio(.s, P/VAs)) + e^B^jVAs, PfVAs) 
{1- e^){s,Pfs) +e^{VAs,PfVAS) 

Although we are assuming that Pqo = and Biq = 1/2 = Pqi, we included 
these above so that (f, (P/g)/)f) in the denominator could be read off by setting 
P := / in the numerator. Also note that the normalization factor making |f| = 1 
is the same in the numerator and denominator and hence cancels. 

Under our assumption Pqo = 0, the expression (jl2[) for P^ is of order e. As in 
the previous protocol leading to the traditional "weak value" ([8]) , we normalize 
by dividing by e and take the limit as e — > 0, obtaining as the result of our new 
protocol the new "weak value" for {s,As) conditional on postselection to q: 

^^EMn^^{s,PfVAs)^ if^^ (13) 
e {s,Pfs) {f,s) ^ ' 

where the last inequality was obtained by manipulations similar to those leading 
to (I5|),to which this this reduces when V — I . 
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To see that ^ and need not be equal, take S to be two-dimensional 
with orthonormal basis s,s^. Define A and V by the following matrices with 
respect to this basis, 



with 1 77 1 



A 



= 1, 

' s^)/V2- Then the numerator of ([1 



" 


1 


V 


■ 1 





' 





1 


,7 


= s, 






As 


= s 



and As = 



Let 



becomes 



Equation (|13p for this special case becomes 

EeiB\f) _ 



lim 



(14) 



Obviously, this is not independent of 77, as it would have to be if "weak values" 
were unique. Any real number between —1 and 1 can be obtained as a "weak 
value" for the specified s, / and A. 

The title of this subsection, "weak values are not unique" , summarizes its 
conclusion. The reasoning leading this conclusion may be summarized as follows. 
Given an observable A on S and a reproducible pure state s of S, we defined 
a one-parameter family, indexed by I77I = 1, of weak measurement protocols (in 
the sense of Definition 1) to approximate (s. As) while negligibly changing the 
state s of iS*. Applying one of these weak measurement protocols conditional 
on successful postselection of the (negligibly changed) state of to / gives a 
conditional expectation called a "weak value" of A. This "weak value" is not 
independent of the protocol, i.e., not independent of rj. 

Since all these different "weak values" were obtained by conceptually identi- 
cal reasoning, we see no reason to identify any one of them with some intrinsic, 
measurement-independent property of system S. The "standard" weak value 
([5]) is one of these and seems generally identified in the literature (often implic- 
itly) with the expectation of A conditional on successful postselection to /. We 
think such an identification fallacious. 



6 Historical summary and simplification of tra- 
ditional approach 

6.1 Overview 

This section describes an approach similar, but not identical, to that pioneered 
by Aharonov, Albert, and Vaidman [3], which builds on a classical theory of 
measurement due to von Neumann [5]. The cited paper of Aharanov, Albert, 
and Vaidman will be called "AAV" below. It culminates in a formula for weak 
values identical to p3)) except that the real part symbol 3? is omitted. 
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The omission is rather curious. Their footnote 4 seems to recognize that the 
real part should be taken, but the uncontrolled (and questionable) approxima- 
tions in the main text yield (jl3p without the real part. Subsequent literature 
mostly takes the real part, so when we refer to the "usual" weak value for- 
mula, we mean (expressed in the notation of [3] , which is commonly used in the 
literature) 

"usual" weak value of observable A 

with initial state ipm and post-selected to state ipf 

= (15) 

In notation used above and to be used below, this would read 

"usual" weak value of observable A 

with initial state s and post-selected to state / 

= srI/i^ (16) 

All literature on weak values known to me gives the impression that (fTS]) (or 
psp without the real part) is a universal formula which is to be expected in all 
experimental situations. We have seen in the last section that this is not so. 

Most "derivations" (which really should be called "motivations") of (fT5)l in 
the literature are mathematically imprecise and overly complicated. Wc hope 
that a cleaner motivation may help clarify the domain of applicability of (fT5|) . 
Our conclusion will be that (|15p may well hold in some experimental situations, 
but that claims to its universality should be critically examined. 



6.2 Preparation of the initial state for a weak measure- 
ment 

The analysis of the previous section started with initial states called f(e) in ([6]) 
or ■r(e) in (|10p . and it was implicitly assumed that such states are physically 
realizable. By contrast, the literature such as AAV generally starts with an 
initial product state s m ^ S M (product states are generally considered 
physically realizable) and obtains the desired entangled state to measure in the 
meter system by applying a unitary operator U to obtain U{s^m) as the analog 
of our slightly entangled starting state f(e). The unitary operator U is usually 
considered as a time evolution operator U — e~*^* with H the Hamiltonian, 
and its only function is to assure that the initial state U{s <l) m) is physically 
realizable. In our version of the AAV approach, t will be considered as a small 
parameter, which subsequently will be called e. 

Both von Neumann and AAV use a "meter" Hilbert space M defined as the 
space of all square integrable complex-valued functions on the real line R. This 
space is known to physicists as the Hilbert space of a single spinless particle in 
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one dimension and to mathematicians as L^(R), the space of all complex- valued 
functions g = g{q) of a real variable q which are square- integrable: 




\g{q)\'^dq < oo 



The inner product on M = i^(R) is defined as usual for 5, /i e M by 

/oo 
g{qrh{q)dq . 
-00 

Two important Hermitian operators on L^(R) are the position operator Q de- 
fined for g G (R) satisfying certain technical conditions (which we do not list 
here because they will not be important to us) by 

Qg{q) qg{q) for all qeR, (17) 

and the momentum operator P defined by 

{Pg)iq) -i^ for all qeR. (18) 
dq 

For any real a, if we expand e'^"^^ in a formal power series and apply that 
to a smooth function g e _L^(R), using Taylor's theorem we obtain 

(e-*"'^5)(9)=5(g-«) • (19) 

In other words, e~'"^ translates the graph of 17 a distance of a units to the 
right. We shall denote this translate of g hy ga- 

ga{q) ■■= g{q - a) . (20) 

This calculation was purely formal, but it is well known how to formulate defi- 
nitions under which it can be rigorously proved 

In the finite dimensional context considered in the last section, physical re- 
alizablility of the entangled state to be meter measured (f or f of the previous 
sections) is not an issue because any unitary operator on a finite dimensional 
Hilbert space can be physically realized to an arbitrary approximation by phys- 
ically constructible quantum gates ([2], Chapter 4), and any state can obviously 
be obtained by applying a unitary operator to a product state. 

The issue in infinite-dimensional contexts is sidestepped by both von Neu- 
mann and AAV via the assumption that the required Hamiltonian will be phys- 
ically realizable. They both essentially use a Hamiltonian H of the form 

H^A®P , (21) 

where A is the Hermitian operator on S whose expectation (s. As) is to be 
approximated by a meter measurement. 

^^In careful treatments of the foundations of quantum mechanics such as [8j, the momentum 
operator P is defined as the infinitesimal generator of translations, so that {e~^^^ f){q) = 
f{q — (3) is true by definition. Then it is later verified that P = —id/dq on an appropriate 
domain. 

AAV uses a slightly more complicated expression, but in the context of their argument 
it is essentially the same as l|21| l. 
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7 AAV's extension of Von Neumann's theory of 
measurement 

7.1 Our adaptation of Von Neumann's general framework 

Von Neumann [6J did not consider "weak" measurements per se. For the pur- 
poses of dealing with them, we shall modify (|2ip by inserting a small positive 
parameter e to measure the strength of the interaction: 

H{e) -.^ eA® P . (22) 

(Alternatively, e could be considered as a small time; the mathematics is insen- 
sitive to this sort of variation of the physical picture.) Eventually, we shall take 
a limit e ^ after appropriate normalization. 

The operator to be measured in the meter system (the analog of the operator 
B of the previous sections) will be the position operator Q. We will see that 
measuring Q in the state e~'^^'^-'(s (g) m) will yield an average value of e(s. As). 

The setup of AAV is superficially different in that they interchange Q and P, 
so that their "meter" measures the momentum of the "pointer" rather than its 
position. Since Fourier transformation implements a unitary equivalence which 
takes P to —Q and Q to P, there is no essential mathematical difference between 
the two formulations. 

The significance of the Hamiltonian ((22l) is most easily seen in the case in 
which A has only one eigenvalue a, so that A = al, where / is the identity 
operator on an n-dimensional Hilbert space. Let 01,02, ... ,a„ be an orthonor- 
mal basis for S. Then S ® M identifies naturally with a direct sum of n copies 
[tti] ® M ^ M of Af, where [a.^] denotes the one-dimensional subspace of S 
spanned by o^, and = denotes isomorphism. Moreover, each of the copies [oi](g)M 
is invariant under H{e) and may be naturally identified with M = L^(R). When 
so identified, e"'^^'^-' acts as translation by ea: (e^'^^'^^g)(q) — g{q — ea). 

Thus when A has just one eigenvalue a, e~*^^" simply translates the initial 
probability distribution of pointer readings by ea. If the starting meter state 
m yields an average reading 7 = (m, Qm), then the average reading in a state 
g-*^f(e)(g (g) jji^ YfWi ^ _j_ gQ,_ YoY simplicity of language, we choose the origin 
of R so that 7 = 0, i.e., so that the average meter reading in state m is 0. 

We "read the meter" by starting with the reproducible product state s(^m, 
applying e"'^^"^-* , and measuring I (i) Q. Repeating this many times obtains an 
average value for I ® Q oi ea, which for fixed e can be determined to arbitrary 
accuracy by measuring sufficiently many times. Dividing by e gives a. It seems 
reasonable to hope that this measurement might be "weak" in the sense of Defi- 
nition[l]bccause for small e, e^^^'^'^^ {s®m) — s(X)e~*^"'^m = s^m^a is arbitrarily 
close in norm to so ® mo (because translations are strongly continuous). 

That is a hand-waving plausibility argument of the type often accepted in the physics 
literature, not a proof. It is actually wrong in the sense that it cannot easily be made into 
a proof; one sticking point is that the trace and operator norms are not equivalent when M 
is infinite-dimensional. However, the weakness of the measurement can be justified by other 
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The situation is similar if A has several eigenvalues ai, ^2, . . . , ctn (not nec- 
essarily distinct) with a corresponding orthonormal basis ai,a2, ■ ■ ■ ,a„ of eigen- 
vectors: Aitti = aittiji = 1,2, ...,n. In that case, S ^ M decomposes as a 
direct sum ©i([ai] (g) M), where each [a^] (g) M is invariant imder H{e) and has 
a natural indentification with M = L^(R). The action of e"*^'*^-* on [oj] (g) M is 
as described above when A had only one eigenvalue. 

Denote the decomposition of the initial normalized state s € S as a direct 
sum of orthogonal states in [ui] as 

s = ®i Si , (23) 

with Si e [tti] (so that each Sj is a multiple of Oj), and J2i kiP = 1- Then 

(s (g) m, (j4 (g) I){s (g) m)) = ^Q!i|s,|^ 

i 

is the expectation (s, As) of ^ in the state s, which we shall show is also equal 
to 

^e-^g(0 (g g) m), Qe-^^(") (s m)) ^^4) 

Denoting (as always) by the translate of a function g by g^iq) := g{q — P), 
we have 



(e-*^^'^) (s ® m), (/ (g Q)e-*-^('^) (s ® m)) = (©^Sj (g (m),„^ , ®kSk ® Q((rn),a J) 

= ^\s3?{meai-,Q'meoii) 



/oo 
dqm*{q — eaj) qm{q — taj) 
■J 

dqm*{q){q + eaj)m{q) 

-00 



= E 



= e(s,^s) . (25) 

In the second line we used the orthogonality of the the Sj to convert a J2j,k 
into a X] j : a-nd the passage to the next to last line uses the assumption that the 
expectation {m^Qm) of Q in the state m is 0. 

The expectation (s. As) of A can be determined to arbitrary accuracy by 
averaging a sufficiently large number of measurements of pointer position Q for 
small, fixed e, and finally dividing by e: 

, . , _ {e-'"^'>{s g) to), (J 8) Q)e-'^(^>(s g) m)) 

\S,j4s) — . (ZD) 



means under additional hypotheses. Appendix 2 examines the surprisingly delicate issue of 
"weakness" in more detail. 
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The normalized (by dividing by e, which is our definition of "normahzed" in 
this context) conditional expectation of (/ (g) Q) given postselection to / S S* is 

1 je-'"^"^ (s ^ to), {Pf (g> Q)e-'-^<^) {s (g) m)) 

e (e-»-f^(<^)(s(g)m),(P/(g)/)e-^-f^(^)(s(8)TO)) ' ^ ' 

The numerator is the expectation in the state e~^^^'^^ (s ® to) of the product of 
the commuting observables I®Q and Pf ® I. This product has the value of Q if 
the postselection succeeds and otherwise. The denominator is the probability 
that the postselection succeeds. 

Assuming weakness of the measurement, i.e., that after applying e"'^*-*^' to 
s®m and then measuring Q, the state of S will be close to the original state s, 
we have just described a weak measurement protocol (in the sense of Definition 
[1]) for approximating (s. As) to arbitrary accuracy while making an arbitrarily 
small change in the original state s of 5*. The technical problem of proving 
weakness will be examined in Appendix 2. For the moment, we assume it. 

The end of Section 4 pointed out the logical fallacy of identifying ([27)1 with 
the expectation of A in the initial state s conditional on postselection to /, as 
equation (f26|) tempts. If nevertheless we make this identification (as AAV do), 
then we shall show that the limit as e ^ of ([27]) produces a "weak value" 



for A. This coincides with the "weak value" obtained by Lundeen and Steinberg 
[TP] by different methods. By contrast, the "weak value" obtained by AAV was 

W ■ 

Before calculating (P7)) . we continue the historical exposition by indicating 
how AAV obtained (|29p. Instead of assuring weakness of the interaction by 
making the Hamiltonian H{e) close to 0, as we did above, they use the above 
Hamiltonian iJ(e) with e := 1 . To obtain a "weak" interaction which affects the 
state of S only slightly, they vary the initial meter state m, which they assume 
real with a square (which is a probability distribution on R) of Gaussian form: 

2 exp(-gV2(7^) 
('^^ = (2.)V2, • (30) 

They attempt to obtain weakness of the measurement by taking a large, meaning 
that the Gaussian is very spread out. One hopes that this might assure weakness 
because then (to)q,; will be close in norm to to for all of the eigenvalues ai of 
A, so that tracing the state e~*^(s (g) to) over M to obtain the state of S might 
be hoped to yield 

e~^^{s ® to) = ^ Si (g) TOai ~ ^ Si®m ^''^^"^ ^ g. = s. (31) 
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(The above implicitly identifies vector states v € S ® M with the corresponding 
density matrices Pu, in order to apply trM -HI 

Their argument involves detailed calculations with Gaussians employing un- 
controlled approximations After calculating ([27|) . we shall note that the same 
argument (assuming weakness, as do AAV) is easily adapted to produce a math- 
ematically rigorous version of the calculation attempted by AAV, but concluding 
with a different result. 



7.2 Calculation of the AAV-type "weak value" 

Before beginning, we summarize the notation to be used. Since we shall be 
working with a "meter" Hilbert space M := L^(R), which is a function space, 
we change notation for the norm of an element g of Af to 




-,1/2 



to distinguish it from its absolute value 15(9)1. The Hilbert space S of the system 
of primary interest will be assumed finite dimensional, and we continue to denote 
the norm of elements of s as \s\. Also, we denote the norm of u e S"® M by 

We assume given a reproducible product state s^m^S®M = SiS)L'^ (R) 
with |s| = 1 = ||to|| to which we shall apply the unitary operator e~^^'^^^ = 
^-leA^p -(-Q obtain a state on which to perform a meter measurement, which 
means measuring {I^Q). After performing the meter measurement, we postse- 
lect to a given final state f € S. Postselecting to / means measuring P/(g)4l3 and 
discarding results in which this last measurement gives 0. The results which are 
not discarded are then averaged and normalized (by dividing by e) to produce 
a final "weak value" which is often interpreted in the literature (incorrectly, in 
our view) as an approximation (becoming exact as e ^ 0) to the average value 
of A in the state s conditioned on successful postselection to /. 

We assume that S is finite dimensional. Let A : S ^ S he a given Her- 
mitian operator, for which we desire that the average meter measurement will 

^^This argument is highly suspect because the "fs;" is only obvious for the Hilbert space 
norm, and trjvf is not continuous from the Hilbert space norm on 5 M to any norm on 
S. However, Appendix 2 shows how it can be fixed. We include it primarily to provide a 
motivation for the AAV approach which would probably be accepted in the physics literature 
and secondarily to underscore the need for care when making approximations. 

^^A. Peres [9] has characterized equation (3) of AAV as a "faulty approximation". It seems 
surprising that questions about basic mathematical procedures leading to new and striking 
conclusions have remained unresolved in the literature for over 20 years. 

^^More precisely, we model the procedure of measuring Q followed by postselection to / 
as a measurement of Pf Q, the value of which is the value of Q if / is obtained, and 
otherwise. Discarding the results for which / is not obtained then corresponds to division 
of the average value of this measurement by the average value of a measurement of Pf I. 
But it is easier to think and speak of first measuring Q, then Pf. The subtle difficulty in the 
latter way of thinking is that the state of the system after a measurement of Q (but before 
the postselection) is not precisely defined by standard quantum mechanics, as discussed in 
Appendix 2. 
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approximate (s, As). Our goal will be to calculate 

lim ^ i('''"^'Hs'^^)'iPf'»Q)e'""''"Ks^rn)) 

e™ e (e-*-f^(<^)(s(g)m),(P/(g)/)e-*-f^(^)(*«"")) ' ^ 

which corresponds to the procedure just described. The numerator of the second 
fraction is the expectation of the measurement of the product of the commuting 
operators I ® Q and Pf (g) /, which is the expectation of a measurement of 
Q with measurements in which the postselection fails counted as zeros. The 
denominator is the probability that the postselection succeeds, and the entire 
quotient the expectation of Q conditional on success of the postselection. 
The result of the calculation will be the "weak value" 

assuming that (/, s) ^ 0. When (/, s) = 0, the expression is undefined. 

However to obtain this result, we shall need the following additional assump- 
tions on the initial meter state m G (R) . All but one are mild regularity and 
growth conditions. All are satisfied by the Gaussian meter states used by AAV. 
These assumptions are: 

1. We assume that m is a real- valued function. A previous version of this 
paper stated that this was not an essential assumption, but that was 
a mistake. The proof as given in the rest of the section does require 
this assumption. A later section added after the mistake was discovered 
completes the proof to be given below without this assumption, obtaining 
still more non-traditional "weak values" . 

2. We continue to assume, as in the preceding discussion, that {m,Qm) = 0, 
i.e., that the average meter reading is initially "0". 

3. We asume that m satisfies the growth condition 

lim = qni^{q) = 

q— *±oo 



(33) 



4. We assume that m has a continuous second derivative m" = m"{q) satis- 
fying the growth condition 

lim q^\m"{q)\ . 

g— >±oo 



These assumptions can be weakened in various ways which seem not very inter- 
esting. Most are the kind of regularity and growth conditions typically assumed 
without mention in physics calculations. The statements given were chosen for 
their simplicity. 

Actually, we shall perform a more general calculation which will yield a 
"weak value" which is in general nothing like the traditional value ([28| , in order 
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to illustrate that even in the AAV context, "weak values" are not unique. The 
ideas are the same as in the finite dimensional example of nonunique weak values 
in Section 5.40 

We assume given a normalized "initial state" s of S, and a normalized "meter 
state" m of M satisfying the above conditions. As before, ^ is a given Hermitian 
operator on S whose expectation in the state s, {s,As) is to be measured. Let 
oi, 02, . . . , a„ an orthonormal basis of eigenvectors for A, and write 

n 

s — Si, with Si a multiple of a.; and js^p = 1. (34) 

i—l i 

For the more general calculation, we also assume given a unitary operator 
V : S ^ S which satisfies Vs = s. In place of the unitary operator e"'^^*^^ 
which prepared the state for the meter measurement (by applying it to s (g) m) , 
we shall use 

{V (S) /)e-'^(^) ^{V(^ /)e-"-^®^ , (35) 
resulting in an initial state 

{V ® I)e-'"^''> (s m) = ^ Vsj ^ m,„^ (36) 

j 

just before the meter measurement, which will use 

((V (g) I)e-'"(^^'>s (g m, (/ Q){V ® I)e-'"^'-'> s m) 

e 

to approximate {s,As). The "weak value" to be calculated is ([5^ with e~'^(^^ 
replaced by {V ® I)e-'"^'''> : 

lim ^ ^ I)e~'"^'^ js ® m), (P/ ®Q)iV^ J)e-'^(^)(s ^ w)) 
e ((F® J)e-'^('=)(s(g)TO),(P/(g)/)(y(g)/)e-*-f^(<^)(s(g)TO)) ' 

Of course, taking V := I recovers the AAV-type situation previously discussed 
and l|32p . The only reason for the assumption Vs = s is to assure the weakness 
of the measurement, as will be shown in Appendix 2. Since this assumption will 
play no role in the following calculation, we refrain from replacing Vs by s. 
First we calculate the limit as e ^ of the denominator of ([5^ . 



lim((F (g) I)e-'"^^^ (s ® m), (P/ ® I){V ® I)e-'"^^^ (s ® m)) 





hni ((F (g) /) ^ (g) me« . , (P/ (g /) (F (g /) ^ Sj (g m^a . ) 



+0 



That section employed a unitary operator V on S with Vs = s. The choice V := I yielded 
the traditional weak value ?R{{f, As) / {f, s), and other choices yieled other weak values. By 
using a non-real meter function m ^ (R) , one can obtain non-traditional weak values even 
with V := /, as explained in a later section. Thus the reader should feel free to set V .= I on 
first reading. 
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y^CVsi^PfVsj) Km (rriea,, 
J2{Vs.,PfVs,) 



1,3 

= {f,Vs)(VsJ) , (39) 

where we have used the fact that translations are continuous in the Hilbert space 
norm on L2(R-) to eUminate the factor involving the inner product of the m's, 
and obtained the last line by recalling from (p4|) that J2i — ^■ 
Next we calculate 

lim - , (40) 

where T : 5 — > is an arbitrary Hermitian operator. For T := I, this will 
validate (j37p as an approximation to (sjAs), and for T := Pf, it will give the 
numerator of (1381). We have 



{{V (g) I)e-'"^'-^ {s(g)m),{T(g) Q){V (g) I)e-'"^'\s (g) m)) 

= ®I)^Si<g> m,a,, {TV g)Q)Y^ Sj g) mea,) 

i 3 

(tocq., (Qm)eQj.) . (41) 

i,3 

The inner product involving m is 

POO 

(rriea, , Qmec^. ) = / m*{q- eai}qm{q~ eaj)dq 

— — oo 

m{q)(q + eai)m{q — e(aj — a^)) dq 
m{q)eaim{q — e{aj — Ui)) dq 

m{q) q [m{q) - m'{q)e{aj - Ui)] dq + O(e^). (42) 



The second line was obtained by a linear change of variable in the integral. For 
the last line, we performed a power series expansion about q to order 1 with 
remainder and used the growth conditions to estimate the remainder term as 
O(e^). In detail, for any q and (3, 

m{q + e/5)) = m{q) + m' {q)e(3 + m" {Cq){e(3f /2, 

where C,q is between q and q + e/3, and the growth conditions on m and m" 
ensure that the integral of the terms involving m" is O(e^). 

The first integral in the last line of ((42|) is €{ai + o(l)) (because for any (3, 
||me;3 — m|| = o(l), where as usual in this context, o(l) represents a term which 
goes to as e ^ 0. Integrating the second integral by parts yields 

(m,„,,Qme„,) - eN+ ~"' ] +0(e') + eo(l). (43) 



28 



Hence 



e^o e 2 ' ^ 



Combining this with (|¥T|) gives (UOl) as 

hm ^'-^ ^)e"'-^(')s m, (T Q)Fe-*-^('^'^)s to) 

a," + a. 



^ {Vs,,TVAs) ^ ^ (yAs,rysj-) 

« ^ i ^ 

(\/s, T\/As) + (FAs, TVs) 
2 

= 3?(ys,TV^As) . (45) 
To understand these manipulations, recall that the Si were defined in p4p by 

Specializing to T := / gives 
hm = ^{Vs,VAs) = (s,v4s), 

e^O e 

(46) 

justifying ([37]) as an approximation to {s^As). 
Specializing to T := Pj gives 

^.^ ((F ^ /)e--^(^)g ^ TO, (P/ ^ Q)iV ^ /)e-'^(^-^)g to) ^ ^^^^ yAs)(Fs /)) 
c— >o e 

(47) 

Combining this with equations ([39|) and (|45|) yields the "weak value" 

lim '^Q)(^'8'-^)e~''^^"^g'^"^)) 
e™ e{{V (g) /)e-*-f^(^) (s (g) to), (P/ (g) J)(F ig) /)e-*^(^)(s (g to)) 
(/,T/Ag)(Fg,/) + (l^Ag,/)(/,T/g) 
2(/, Vs){VsJ) 

^ ''f^ ■ 

Specializing to V — I yields the AAV- type "weak value" : 

j.^ (e"^(^)(s ^ to), (P/ ^ Q)e-'^(^)(g to)) ^ ^ (/,As) 

«™ e(e-*-f^(<^)(s«)TO),(P/g)/)e-'-f^(^)(sg)TO)) (/, s) ' ^ 

7.3 "Weak values" are not unique, even in an AAV-type 
context 

We have seen that applying {V (g I)er^^^ to a starting product state s (g to to 
obtain a state to be measured and postselected results in a "weak value" given 
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by (BZl), 

^ {f,Vs) {f,s) ' 

where we have finally used the assumption Vs — s, which will be shown in 
Appendix 2 to guarantee "weakness" of the measurement. This does not look 
like the AAV- type "weak value" of equation ([^5]) , namely 

Jf ^^^^^ , (51) 

but for logical completeness, we should check that they can be numerically 
different, as well as different in appearance. The calculational argument given 
at the end of Section 5 is easily adapted to show this, but for the reader's 
convenience and for variety, we give here a more general argument which avoids 
calculation. 

Suppose the two "weak values" given by equations ((50)) and (jSTj) are always 
equal. Then 

for all states s, f G S with (/, s) ^ 0, all Hermitian A: S ^ and all unitary 
V : S ^ S satisfying Vs = s. 

Take A to be any Hermitian operator with As not a scalar multiple of s, i.e., 
s is not an eigenvector of A. Let u := As — {s,As)s denote the component of 
As orthogonal to s. Then for any w € S with \w\ = \u\ and {w,s) = 0, there 
exists a unitary ^ on 5 with Vs = s and Vu = w, i.e., VAs = w + (s, As)s, and 

^{LVA^ = <^ + (. As) 

By varying w, we can change ^{{f,VAs)/{f,s) except in the special case in 
which / is a multiple of s. 



8 Relation of the AAV approach to ours 

As mentioned earlier, our method of obtaining weakness of the measurement 
differs from that of AAV. This section notes that the difference is only cosmetic, 
and it also notes what we suspect may be an essential error in the mathematics 
of AAV. 

We obtain weakness of the meter measurement by replacing the state prepa- 
ration Hamiltonian A^P with H{e) :— eA(E)P. This makes the normalized (i.e., 
divided by e) expectation of the meter measurement conditional on postselection 
to / (cf. dSH)) equal to 

^{e-'"'^''Hs®m),{Pf®Q)e-'"^'\s®m)) 
(e-'-f^(«)(s (g) m), (P/ (g) /)e-*-f^(^)(s (g) to)) ' ^ ' 



30 



By contrast, AAV attempt^^ to obtain weakness by using a fixed preparation 
Hamiltonian Ai^P (multiplied by an inessential constant) and instead replacing 
our fixed meter state m by an e-dependent meter state TO^^y[e](-) defined for 
e > by 



■mAAv[e]{q) 



exp(-e2g2/2)T'/' 



(54) 



V27r/e 

which makes mAAvi^Y ^ Gaussian centered at with variance l/e^. Here we are 
translating AAV into our notation. A reader consulting AAV should remember 
that they interchange P and Q relative to our convention; i.e., their "meter" is 
P, and their preparation Hamiltonian is —A (X) Q instead of our P. 

To see the relation of the two approaches, let to be a fixed meter state (not 
necessarily a Gaussian as in AAV) satisfying the conditions 1 — 4 of Subsection 
7.2, and for e > define a new meter state m[e]{-) S i^(R) by 

■m[e]{q) m{qe)^/e . (55) 

The graph of TO[e](-) is the graph of m expanded by a factor of 1/e and then 
normalized to make its norm || m[e] \ \ equal to 1. For m := TOyiyn/[l] as given 
by (j54p with e :— 1, m[e\ coincides with mAAvi^) ^'^^ arbitrary e > 0. 
We shall show that (l53t can be rewritten as 



(56) 



l(^f,-^H{e) ^ (p^ ^ Q)e-'"'^'^ {S (g) to)) 

^g-i_f/(€) (s (g, to), {Pf (g) /)e"»-f^(<^) (s (g) to)) ' 

{e-i"(^) (s (g) m[e]), (Pf (E) Q)e-'"(^^\s (g) m[e]) 
{e-iH{i) (s (g) rn[t]), (Pf (g) J)e-^«(i) (s (g) TO[e])) ' 



In other words, the result of using the fixed Hamiltonian H{1) with the e- 
dependent meter state TO[e] is the same as using our e-dependent Hamiltonian 
H{e) with our fixed meter state m, and then normalizing by dividing by e. 

We shall show that the numerator of the left side of equals the numerator 
of the right side, leaving the similar calculation of the denominators to the 
reader. The notation will be that of the previous calculation of the AAV-type 
weak value; in particular, s = Si is the decomposition of s as a sum of 
orthogonal eigenvectors Si of A with respective eigenvalues a^, Asi = aiSi, and 
for g e L2(R), gp{q) :— g{q — (i). Recalling that 

e-'"^^Hs®g)^^s,®ga^ , 

i 

we have, setting q' :— eg, 

{e-'-^^^h® TO[e], {Pf (g Q)(s (g TO[e])) 

/OC 



^^We say "attempts" not to cast doubt on the "weakness" of their procedure, but because 
they do not prove weakness, nor even discuss it. 
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^(sj,P/Sj) / m{q' - eai)){q' /e)m{q' - eaj)€dq' /e 



(57) 



which is identical to the the numerator of as calculated in (|¥T|) . (H^ . and 
the following equations. 

Since the AAV-setup seems essentially mathematically identical to ours, one 
may wonder how the two approaches end with different weak values, namely 
(/, As) I (/, s) for AAV and the real part of that for us. Few readers will want 
to slog through the mostly routine but tedious mathematics of both to look for 
errors, so it may be helpful to point out what we think might be an essential 
error in AAV. 

It is well known and routine to calculate that \i g = g{q) is a function in 
L^(R) and g = g(p) denotes its Fourier transform, then multiplying g{q) by e'"', 
with a real, translates its Fourier transform by a. That is, if h{q) := e'^°"'g{q), 
then h{p) = g(j) — a). But this holds only for real a; for nonreal a, e^°"^ grows 
exponentially for large \q\ and h cannot be expected to even have a Fourier 
transform, since it cannot be expected to be in . But in passing from their 
fundamental equation (2) (via their uncontrolled approximation (3)) to their 
(5), AAV seems to assume the just-mentioned fact about Fourier transforms for 
complex a. We suspect that this may be the origin of the difference between 
their "weak value" and ours. 

9 Another way to obtain non-standard "weak 
values" 

We present another way to obtain "weak values" which differ from the usual 
formulas 3fi((/, As) / (/, s)) or (/, As)/{f, s) . It is motivated by a comment on the 
Internet newsgroup sci. physics. research which noted that the unitary operator 
V appearing in the previous examples depends on the initial state s. But what if 
the initial state is not known? The commentor (who posts under the pseudonym 
"student" and whom I thank) thought that it would be a rather peculiar type 
of measurement that would not work for all states. 

We agree that this does complicate the situation, but not fatally. First of all, 
it is frequently the case that the initial state is given by the physical situation; for 
example this is the case for the investigations of "Hardy's paradox" in [13] and 
[5]. The nonuniqueness of weak values for such situations is a counterexample 
to claims that weak values must be given by the above "usual" formulas. 

Second, in all formulations of weak measurements known to us, the initial 
state s has to be assumed to be "reproducible", i.e., the experimenter must have 
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an apparatus which will produce any number of copies of s. This is because for 
very weak coupling between the system of interest S and the meter system, a 
large number of meter measurements may be required to obtain a reliable aver- 
age. And in finite dimensins, a reproducible state is effectively known because 
its components with respect to any given basis can be determined to arbitary 
accuracy by quantum tomography ([5] pp. 389 ff). If we want to make a weak 
measurement involving a starting state s G 5, we can first calculate its compo- 
nents and then construct the desired operator V (to arbitrary accuracy) using 
quantum gates ([2], Chapter 4). 

However, the comment did cause us to look for other ways which did not 
depend explicitly on the initial state to obtain non-traditional weak values. One 
such way was already known from our study of the Yokota, et al. paper [5J (as 
described in the next section), but this seemed undesirably complicated for 
readers without detailed knowledge of that paper. 

The result of the search seemed surprising. We shall present below a way 
to obtain weak values such that the preparation Hamiltonian (the analog of 
i?(e) := eA (S) P of (|22p ) does not depend on the initial state s G S. The 
preparation Hamiltonian used is very similar to the ((22| used by von Neumann 
and AAV except that it requires only a finite dimensional meter space (which 
can be as small as dimension 2), instead of the infinite dimensional meter space 
I/^(R) used by them. Since we continue to assume that the system S of interest 
is finite dimensional, the finite dimensionality of the meter space means that the 
calculations will be purely algebraic, and rigorous. The algebra will be identical 
for an infinite dimensional meter space, though further considerations would be 
necessary to make the calculation rigorous. 

The general method will be identical to that used in Subsection 7.2 to cal- 
culate the AAV-type "weak value" . The difference is that instead of the prepa- 
ration Hamiltonian -ff (e) := eA iSi Q used there, we shall use a preparation 
Hamiltonian of the form 

H{e) ■.^eA(g)G (58) 

with G : M M Hermitian. The Hcrmitian operator on M whose value gives 
the meter reading will be called B : M ^ M instead of the Q used in Subsection 
7.2. Thus G plays the role of P and B the role of Q in Subsection 7.2. 
A starting state s (g) m is assumed given, with \s\ = 1 = \m\ and 

(to, Bm) = (59) 

as before. The state to be meter-measured will be obtained as before by applying 
^-iH{e) g^jn- 
^-i^A^G) [s ® m) ^ s ® m - ieAs ® Gm + 0{e^) . (60) 

Then the normalized (by division by e) average value of the meter measurement 
is 
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_ (s,s){m,Bm) - ie{{s, As){m, BGm) - {As, s){Gm, Bm)) +0(e^) 

e 

= -i{s,As){m,BG-GBni)+0{€) , (61) 

and 

hm 

e^O e 

= -i{jn,{BG -GB)m){s,As). (62) 

Thus if we choose m,B, and G such that 

1 = -i(m, [BG - GB)m) = 23(m, BGm) , (63) 

measuring B in the meter space and normaHzing by dividing by e (for small 
e) constitutes a weak measurement protocol in the sense of Definition 1. (We 
omit the proof that the measurement negligibly affects the state of S for small 
e, which is identical to that given in Subsection 5.3.) 

The average value of the normalized meter reading conditional on successful 
postselection to / e is 

1 (e-'^(-^^'^)(s ® m), (P/ (8) B){e-"'^^'^^\s ® m)) 

-i((s, P/As) (to, SGto) - (As, P/s) (Gm, Pto)) + 0(e) 

(s,P/s)+0(e) 
(/, As) {s, /) (to, PGto) - {As, f) (/, s) (m, GPto) 



(/,As)(m,PGTO) (As,/) (PGto, to) 



0(e) 



(/,s) (s,/) 



0(6) 



(f,As)(m,BGm) 

= 'jj^ - + 0{e) (64) 

when (/, s) 7^ and undefined if (/, s) 0. The following calculations assume 
(/, s) ^ 0. Write 

(to, PGto) = p + iK with p, k real. (65) 
From ((63|) k = 1/2. Then the limit as e ^ of ([64l) becomes 

lim ^ "^)' (^/ P)(e""('^®°Hs "^)) 

e^o e (e-*'^('4®<^)(s(g)TO),(P/(8)/)(e-"('4®<^)(s(8)TO)) 

= 2.5i%# + 2,c,(/'^^) 



(/,5) " /,s) 



(66) 
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The first term in ([^ is the "usual" weak value 3?((/, ^s)/(/, s)), but when 
As){f, s) ^ 0, we shall show that the second term can be chosen arbitrarily 
by adjusting the value of p = 3ff(m, BGm). 

Recall that m,B, and G are arbitrary subject to Q{m, BGm) = 1/2 and 
(to, Bm) = 0. To see that any number can be obtained for ^{m, BGm) under 
these conditions, take the meter space M to be two-dimensional with orthonor- 
mal basis m, m'^, and define G and B by the following matrices with respect to 
this basis: 



G := 



1 

1 



and B := 



p 

p-i/2 



i/2 



with p real. (67) 



Then BG has the following form, where entries denoted have not been 
calculated because they are irrelevant to calculation of (to, BGm) (which is the 
upper left entry of BG): 



BG 



i/2 * 



,BGm) 



(68) 



When 3((/, As) / {f, s)) ^ 0, by varying p, one can obtain any number whatever 
as a "weak value" for A. 

The above calculations are rigorous for finite-dimensional S and 7\f , and still 
algebraically correct in infinite dimensions. In the physics literature, such alge- 
braic calculations are typically accepted as "proofs" . If we relax mathematical 
rigor to this extent, we obtain from the above a very simple "proof" of the 
"usual" weak value 5R((/, As) j (/, s)) by taking 



M 



L2(R), G:=P, B:^ 



m{q) 



1 



V/2 



1/2 



(69) 



where P and Q are the usual momentum and position operators, respectively, 
defined in Subsection 6.2. 

This "proof" is deeply flawed because the starting equation ([501) , 



I TO — ieAs (g) Gm + O(e^), 



would probably be difficult to justify for our unbounded G := P. Some "proofs" 
of the "usual" weak value formula in the literature rely on uncontrolled approx- 
imations like this. In honesty, they should be called something like "algebraic 
motivations" instead of proofs. 

If we are willing to accept uncritically such uncontrolled approximations, we 
can obtain arbitrary weak values in an AAV-type framework by taking B and 
TO as in equation (j69[) and 



This results in 



G : P + SQ with 5^0 real. 



(to, BGm) = ^ + 2 



(70) 
(71) 
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so that when As)/{f, s)) ^ 0, any "weak value" whatever can be obtained 
using the preparation Hamiltonian H{e) := eA(^ {P + SQ). 

Since 6 can be arbitrarily small, anyone who claims that the "usual" weak 
value formula gives the only experimentally possible result should be obligated 
to explain how the von Neumann/AAV Hamiltonian H{e) := eA ^ P can be 
guaranteed in any experimental situation, and assuming that, how the Hamilto- 
nian AiSiP can be experimentally distinguished from A® {P+SQ) for arbitrarily 
small S. On the level of uncontrolled approximations, this will probably be im- 
possible. 

It is almost immediate that 

e'^'^^'/^Pe^'^"'/^ = P + 6Q (72) 

because for any g e ^^(R), {e'''^''^/'^g)(q) = e"*«'*/2^(g) and P := -id/dq. 
(We say "almost" because a rigorous verification would require careful speci- 
fication of the domain of P, which we have not discussed.) In other words, 
the new Hamiltonian Aig) {P + SQ) is formally unitarily equivalent to the AAV 
Hamiltonian A (g) P. 

Thus the above setup with the new Hamiltonian can be unitarily transformed 
into one with the AAV Hamiltonian. The transformation will take the AAV 
meter state m of equation ()69|) into the new meter state 

q ^ e-'^'^/^m{q) , (73) 

whose absolute square defines the same Gaussian probability distribution on 
position space as m. This suggests that it might be possible to rework the 
rigorous calculation of the traditional weak value 3?((/, As)/{f, s)) in Subsection 
7.2 into a rigorous calculation which gives arbitrary weak values in a slightly 
different setup using the AAV Hamiltonian A ^ P, but the slightly different 
meter state (l73l) . The next section will carry this out. 

10 Another way to rigorously obtain non-standard 
"weak values" in a framework similar to AAV 

The rigorous calculation of the traditional weak value ^{f, As) / (/, s)) of Sub- 
section 7.2 assumed that the meter state m S i^(R) is a real- valued function. 
After that proof was typeset, we have noticed that without that assumption, 
the proof of that section is easily adapted to rigorously obtain non-traditional 
(in fact, arbitrary) weak values in a setting very similar to AAV. More precisely, 
it assumes the AAV preparation Hamiltonian with a non-real meter state which 
is the AAV meter state (the square root of a Gaussian) multiplied by a complex 
function of modulus 1, so the new meter state defines the same Gaussian prob- 
ability distribution in position space as the AAV meter state. This section will 
outline the necessary modifications to the argument of subsection 7.2. We con- 
tinue to assume that the meter state m satisfies the normalization and growth 
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conditions 2,3, and 4 of Section 7.2, but we no longer assume condition 1 that 
it be real. 

The new proof is identical to the old through equation (j4ip . written here 
with a new number, and with V := I: 

{e-'"^"^ {s®m),{T® Q)e-'"^'^ (s ® m)) 
= Sj (g) rriea^, (r (g) Q) Sj (g) meg .) 

i 3 

= ^{si,Tsj){mtai,{Q'rn)iaj) ■ (74) 
The inner product involving m is 

/>OC 

{m^a^.Qm^aj) = / m*{q~ eai)qm{q~ eaj)dq 

•J q— — oo 



m*{q){q + eai)m{q - e{aj - ai)) dq 
m* {q)eaim{q — e{aj — a^)) dq 
+ / m*{q)q[m{q)-m'iq)e{aj-ai)]dq + 0{e^).{75) 



The first integral in (I75p is the same as before, namely e{ai + o(l)), but inte- 
gration by parts only determines the real part of the second integral, which is 
the same as before, namely e{ai — aj)/2. 

We shall not evaluate the imaginary part of the second integral in general, 
but only for a specific meter function m which we assume to be of the form 

miq) = e-'^"'/^moiq) , (76) 

where (5 is a real constant and mg a real meter function satisfying the same 
conditions as assumed in Subsection 7.^^ together with the additional normal- 
ization condition 

(mo,Q"mo> = l . (77) 

For example, we could take itiq as the AAV meter function myii^y[l], defined in 
equation (j54p as the square root of a Gaussian with mean and variance 1. 
Then, since 

m'{q)=e-^^"'/'[m',{q)~tSqmo{q)] , 

POO 

{'mea,,Qmeaj) = / - eai)qm{q ~ eaj) dq 

J q— — oc 

= + o(l)) + + 0(e2) 



Those are that mo be normalized, i.e., ||mo|| = 1, and satisfy the previous condition 2 
that (mo,Qrrto), and the growth conditions 3 and 4. 
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+e{aj - ai) / m.a{q)q mo{q)Si dq 



a 



e- 



+ e{aj ~ ai)6i + eo{l). (78) 



This is the same resuh as for a real meter function plus the imaginary part 
(aj — ai)Si. (We assume that the reader will take in stride the use of the same 
symbol i for the imaginary unit and an index as in a^. Since the previous version 
only used real quantities, no ambiguity arose there, and changing notation for 
the index at this point probably risks more confusion than retaining it.) 

The main step to complete the calculation is to evaluate the normalization 
of ([7^ in the limit e — > 0. There are two cases to consider, T / to demon- 
strate that the normalized meter expectation does equal (s. As) and T := Pf to 
calculate the weak value. Here we take T := Pf, leaving the case T :— I to the 
reader. We have 

lim -({e-'"'-'^ {s ® m), {Pf ® Qje^'^^-'^s (g) m)) 

= ^{Si,PfSj){m^a,,{Qm)eaj) 
i,3 



h [aj — aijdi 



= 5R(As, Pfs) - 2SQ{As, Pfs) . (79) 

To understand the passage to the last line, first recall that the Si are eigenvectors 
of A with Asi = aiSi, and s = Si. We shall concentrate on evaluating 

E(sj,P/Sj)(aj - a,) , 

leaving the other term, which is similar and was effectively done in the original 
proof, to the reader. We have 

J2{sr,PfSj)iaj - a,) = ^^(s,;,P/Sj)(aj -aO 

i,j i j 

= Y.[{s,,PfAs)-a,{s,,Pfs)] 

i 

= {s,PfAs)-{As,Pfs) 
= {PfS,As)-{As,Pfs) 
= 2iQ{PfS,As) . 

The desired weak value is 

lim ^ (g ^ m), {Pf ® Q)e-^^(^) js ® m)) 

e (e-*-f^(<^)(s(g)m),(P/(g)/)e-*-f^(^)(s(g)m)) ^ ' 
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The limit as e ^ of the denominator of the second fraction was effectively 
computed in (|39p (which did not require the reality of the meter function) as 
(/, s)(s, /). Combining this with ([7^ gives the weak value as 



This shows rigorously that when As) /(/, s)) 7^ 0, any weak value what- 
ever may be obtained by some postselected weak measurement protocol using 
the AAV preparation Hamiltonian and a meter state different from that used by 
AAV, but which defines the same Gaussian probability distribution on position 
space. 

Therefore, any argument that the "usual" weak value 3?((/, As) / (/, s)) is ex- 
perimentally inevitable will probably have to argue for the experimental univer- 
sality of the precise AAV setup (meter observable, meter state, and preparation 
Hamiltonian). Since there seems no argument in the literature that this precise 
setup can be realized in any experimental situation!^ much less all^ such an 
argument would probably have to break new ground. 

11 Rambling afterword with conclusions 

After spending months learning about weak values in the hope that it might 
lead to some fundamental new understanding of quantum mechanics, I was 
disappointed that this hope did not materialize. This work was written mainly 
to save others in a similar situation from similarly wasting their time. 

I do not recall the exact route I followed through the "weak values" literature, 
but it took me through all of it that I could find. I think the first paper I read 
might have been a very interesting short paper of Yokota, et al. , which describes 
an experimental measurement of weak values to confirm a quantum mechanical 
prediction concerning "Hardy's Paradox" Then I probably looked at the 
mostly clear and interesting analysis of "Hardy's Paradox" in JL3J, followed by 
carefully reading AAV [3^, but it could have been in the reverse order. I have 
made no attempt to reference all the papers in my thick folder on weak values. 

■^^ Since there have been experiments which do obtain the "usual" weak value, that might 
be considered suggestive that the AAV framework can be experimentally realized. However, 
we have seen that other frameworks can produce the same weak value. We have never seen 
an argument in the experimental literature that the experimental situation actually does 
implement the AAV setup. 

■^^ "Hardy's Paradox" is what it is commonly called in the literature, but Hardy | 12| did not 
present it as a paradox, and I think it questionable to call it that. 



lim ^^''"^"^ (s to), {Pf ® Q)e-*-«"(^) (s (g) m)) 
£^0 e(e-*^(')(s (g) to), [Pf (g) /)e-*-f^('^)(s ® to)) 
^{As, Pfs) - 2(53 (As, Pfs) 
{Ls){sJ) 




(81) 
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I was struck by the fact that ah the literature which I saw gave the impres- 
sion that one of the two versions of the "standard" formulas for weak values, 
equation 3?((/, As)/(/, s), or {f,As)/{f,s) was a universal formula 

to be expected in all experimental situations. But AAV's motivation of ([29]) did 
not seem fundamental because it relied on assumption of the particular Hamil- 
tonian (P^ . without any discussion of how to guarantee that Hamiltonian in 
any experimental situation, much less in all experimental situations. 

Yokota, et al. [5] obtained the standard weak values formula ([28|l using a 
Hamiltonian completely different from that of AAV or other motivations for 
(|28|) that I have seen. For example, AAV and Lundeen/Steinberg's [10] both 
use (|22l) , which employs an infinite dimensional meter space, while Yokota, et al. 
use a four-dimensional meter space. It seemed remarkable that these disparate 
approaches yielded the same formula. 

At first, it seemed inconceivable that they could unless they were both re- 
flecting aspects of some fundamental, as yet undiscovered physical reality or 
mathematical fact. I set out to find an explanation for the fact that AAV and 
Yokota, et al., had obtained similar "weak values" with such utterly different 
methods and physical setups. 

The effort led to disappointment. I did not discover any new physical or 
mathematical principles. 

On careful reading of Yokota, et al.^s [5., analysis, I noticed some arbitrary 
elements. Their method for obtaining weak values was not unreasonable, but 
it seemed idiosyncratic, and I would have done it differently. I worked out the 
consequences of my method and did indeed obtain a different "weak value" . My 
method had no claim to being superior to theirs, but the exercise convinced me 
that weak values are not unique. 

I did not present my method for obtaining weak values in the setup of Yokota, 
et al. [5] in this paper because it is more complicated than the nonuniqueness 
examples presented and would require extensive analysis of [5J. The examples 
presented in this paper may seem artificial, but they have the merit of being 
simple. 

The fact that "weak values" different from those of Yokota, et al., ^5] can 
reasonably be obtained for their setup does not imply that there is anything 
wrong with their paper. Their weak values and mine are both obtained using 
quantum mechanics so standard that it is unlikely to be questioned. So long as 
their experimental procedures accurately reflect the mathematical methods used 
to obtain their weak values, their experiment can be viewed as a confirmation 
of standard quantum mechanics. Their modestly written paper does not claim 
anything further. In particular, they (wisely) do not claim to have resolved 
"Hardy's Paradox" . 

The "Reader's Guide" section remarked that I suspect that the introduction 
to weak values and the finite-dimensional examples of Section 5 may turn out 
to be all that most readers care to learn about weak values. That is because 
the ideas seem so unremarkable when presented in that context and because I 
think that the literature which makes them seem remarkable rests on an implicit 
logical fallacy exposed in that section. But I have no illusions that all readers 



40 



will share these views. 

Readers who do think it worth their while to read further will find what I 
hope is a mathematically rigorous derivation of the standard weak value formula. 
To a non-mathematician it may look complicated, but that is mainly because 
everything is precisely defined and much more fully written out than is usual in 
the physics literature. The fact that it applies to almost any real "meter" state, 
not just a Gaussian meter state as in the existing literature, may be viewed as 
an advance by those who take seriously the universality of the Hamiltonian (|22p 
which AAV and most of the existing literature implicitly assume. 

12 Closing remarks on notation 

The notation chosen may irritate some readers. It is intended to stay as close 
as possible to notation common in the physics literature without sacrificing 
mathematical accuracy (as physics notation often does). It necessarily involves 
some compromises. 

Physicists who have never seen a proper introduction to tensor products may 
find notation like s ^ m strange and excessive. Its merit is that it is precise, 
unlike physics- type notation like |s)|to) or |s)s|to)m- (Either would probably 
suggest to many readers that s is a label for an eigenvalue of some operator, 
which it is not. And if it is already understood, that s is merely a label for a 
vector state in S', what additional information would be given by the brackets 
and subscript of |s)5?) 

The use of "projector" in place of the "projection" nearly universal in the 
mathematics literature may grate on some mathematicians. Who does this 
dude think he is to give him the right to substitute his own notation for what 
is standard"!^ 

I chose "projector" mainly because that is the term most common in the 
physics literature. It also seems to me the more grammatically appropriate 
alternative. If a "bettor" is one who bets, shouldn't a "projector" be one who 
projects? If a "bettor" produces a bet, it seems reasonable that a "projector" 
should produce a "projection" . The projector produces the projection [u] ( 
the subspace spanned by u). Mathematicians who find this unconvincing and 
are still offended by "projector" may be mollified by the assurance that I intend 
to revert to "projection" instead of "projector" in my next purely mathematical 
paper! 

Finally, we have bowed to the tradition of the physics literature by describing 
the time evolution of a quantum system initially in state ■)/' by t — > e^^^^ip, where 
H is the Hamiltonian, instead of the simpler and more natural t i— > e*^*V'- 

■^^A referee for my book on relativistic electrodynamics made a similar impassioned objection 
to my choice of "±" for the Hodge dual operation in place of the "*" which most mathematical 
literature uses, and is usually called the "Hodge * (or star) operation". I had chosen "±" 
because it seemed more common in the physics literature after it was popularized by its use 
in the monumental treatise Gravitation of Misner, Thorne, and Wheeler (MTW) and because 
it more accurately reflects the geometric content of the operation. I imagine that the referee 
was a pure mathematician who might never have seen MTW. 
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The complication of the minus sign, though slight in any given instance, is a 
constant annoyance to mathematicians. In the complex number system, —1 has 
two square roots, and "i" is an arbitrary label for one of them. The two square 
roots have identical algebraic properties, so either of them could be called "i" . 
The convention t i-^ e"'^*?/; suggests that "— j" is in some fundamental way 
different from "i", and that it would be wrong to write the time evolution as 
1 1-> e*^*V- 

13 Appendix 1: The partial trace 

The definition of trace class operators on infinite dimensional Hilbert spaces 
is surprisingly subtle, as is the definition of partial trace. If one does not feel 
the need to worry about convergence and orders of summation, the definitions 
can be lifted verbatim from the corresponding definitions for finite matrices. 
However, since we are trying to perform rigorously calculations which are done 
non-rigorously in the physics literature and have led to serious errors, that 
approach would not fulfill our needs. 

On the other hand, we do not want to devote pages to carefully developing 
the properties of traces. We choose a middle approach stating the properties of 
traces which we shall use, either giving references or indicating how they can 
be proved. Good references for partial traces seem particularly hard to find. I 
would be grateful for any that readers might furnish. 

Besides summarizing properties of these traces which we shall need, this 
appendix performs some calculations which will be used in the main text. Our 
primary reference for the definition of trace class operator is A. Knapp's book. 
Advanced Real Analysis |llj . It efficiently develops the properties of trace class 
operators on infinite dimensional Hilbert spaces, but does not discuss the partial 
trace. 

Below the term "operator" will always mean "bounded operator on a Hilbert 
space. A bounded operator T is said to be of trace class if for all orthonormal 
sequences {ui}°^i and {i^ij^i, 

oo 

^|K,Tve,)| < cx) . (82) 

i=l 

Its trace, denoted tr T, is defined as 

oo 

tr T := ^|(u,,Tu,)| < oo . (83) 
j=i 

This sum can be shown to be independent of the orthonormal basis {ui}. In 
finite dimensions, this is a routine algebraic calculation, but an efficient proof 
for infinite dimensions requires careful organization. 

To define partial traces, it will be helpful to identify operators with sesquilin- 
ear forms: an operator T : H ^ H is identified with the sesquilinear form 
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Qt{ • , • ) defined for all u,v E H hy 

Qt{u,v) -.^ {u,Tv) . (84) 

When naming this form would only be a distraction, we shall refer to it as the 
form 

u,v^{u,Tv) . (85) 

It is well known (and easy to prove) that this provides a one-to-one correspon- 
dence between operators and bounded sesquilinear forms. This identification 
of operators with forms will allow us to read off properties of the partial trace 
from corresponding properties of the ordinary trace. 

Let S and M be Hilbert spaces, S^M their tensor product, and L : S®M — ^ 
S®M a. trace class operator on this tensor product. For given vectors s,s' E S, 
consider the form on M defined for all to, to' G Af by 

TO, to' I— > {s (E)m, L{s' (g) to')) . (86) 

For fixed s, s', this form corresponds to an operator on M which is easily seen to 
be trace class (because L is trace class on S* (g) M). Its trace is the definition of 
the partial trace with respect to M of L, denoted trM L, as a sesquilinear form. 
More concretely, for any orthonormal basis {fi} for M, 

(s,(trMi)s') =^(s®/„,L(s'0/„)) . (87) 

a 

(We sometimes distinguish dummy indices of summation by Greek.) From the 
assumption that L is trace class, it follows easily that the sum in the definition 
(|87|) converges absolutely, and also that tr j\/ L is trace class and tr (tr m L) = 
tr^Efl 

The definition ([87|l can also be viewed more concretely as defining tr^/ L as 
a matrix. If {e^} is an orthonormal basis for S*, and if the matrix of L with 
respect to the orthonormal basis {cj® fk] for S®M is {Lujk), then the matrix 
for tiM L is ^ia.ja)- An advantage of giving the definition in terms of 
forms instead of matrices is that it avoids the nuisance of checking that it is 
basis- independent . 

Let {fi} be an orthonormal basis for M. Then any unit vector u € S" (g) M 
can be written uniquely as 

u = ^s„(g/„ with^|s„p = l . (88) 

a a 

Recall that P„ (the projector onto u) represents the mixed state (often called 
a density matrix on S" (g Af) corresponding to the vector u. We shall derive a 
revealing formula for the mixed state tr^/ Pu- 

^■^ To see this, note that for normahzed s and s' , the sequences {s ^ fi} and {s' (gi fj} are 
orthonormal, so the definition of "L is trace class on S M" applies directly to assure the 
absolute convergence of the sum in II87I I. The other facts follow from similar observations. 
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Let {ci} be an orthonormal basis for S. First note that for any unit vector 
V € S, the matrix of the projector P„ on v is obtained from 

{ej.Pyek) = {ej,{v,ek)v) = (w, efe) (e^, u). (89) 

From the definition (I87p . for any fixed j, k 

(ej, (trAf F„)efc) = ^(ej (g) /„, P„(efe (g) /„)) 

a 

OL 

a 

= ^|s„p(e„P,„efc) (90) 

a 

In summary, 

trM P„ = trM = l^"l'^^o ■ (91) 

a 

(Recall that we are using the convention that Pg = Ps/|s|-) This exhibits trj\/ P^ 
as a convex linear combination of pure states Pg^ ■ (This makes it almost obvious 
that tr7\/ Pu is pure if and only u is a product state, a fact mentioned in the 
main text.) 

It is also of interest to calculate trg P„ for the u — J2a ^ fa given by ((88l) . 
with |m| = 1. Let {a} and {fj} be orthonormal bases for S and M, respectively. 
Then the matrix of trs Pu with respect to {fj} is given by 

ifk, (trs Pu)fj) = XI ® ^ fo)) 

a 

= ^{P^a ® fk, 3/3 (g) ff3, Ca (g /,) X (g J^.) 

a /3 7 

o 

= , (92) 

where the last equality is Parseval's equality. 

The statement and proof of equation ([92]) assumed for simphcity that |u| = 1, 
but of course the case of arbitrary nonzero u can be immediately obtained 
by normalization. Since this result will be needed in Appendix 2 , we state 
it explicitly for the reader's convenience. Let {fi} be an orthonormal basis 
for M. Then for any nonzero vector u = J2a ® fa G S' (g M, the matrix 
{{fk, (trs P«)/j)) of trs Pu is given by 

for u = EaSc.<g/a, (/fe , (tr^ P„)/j ) = . (93) 
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14 Appendix 2: Issues in proving weakness of 
the AAV-type "weak measurement" 

Subsection 7.1 outlined a weak measurement protocol and gave a hand- waving 
motivation (equation ()31|) ) why it it might be hoped to be weak in the sense of 
Definition [TJ When examined in detail, the "weakness" issue turns out to be 
unexpectedly subtle. 

So far as I know, this issue has never been recognized in the physics literature. 
This subsection discusses it in detail. 

We shall be discussing the situation discussed in the main text in which the 
state space S of primary interest is finite dimensional, but the meter space M = 
L^(R) is infinite dimensional. We switch to the notation 1 1 to 1 1 :— [J^ \m{q)\'^ dqY^^ 
for the Hilbert space norm of to S L^(R), in order to reserve |to((7)| for the 
absolute value of the function to(-) £ L^(R). We continue to use \s\ :— (s, s)^/^ 
and |u| for the norms of vectors s G S and u G S M. 

The notation will be as in the main text, summarized here for the reader's 
convenience. Let s ® to be a given product state in S*® M with |s| = 1 = ||m||, 
which will be fixed throughout the discussion. Let H{e) := eA (g) P, where A is 
a given Hermitian operator on S, P the momentum operator on M = L^(R), 
and V a unitary operator on S with Vs = s. 

Recall that we apply {V ® /)e~*^^^^ to s (8) to to obtain a state {V (8) 
/)e~'^^'^-' (s ® m) in which the expectation 

{{V /)e-*-f^(") (s (gi to), (/ ®Q){V® I)e-''"'^'^ (s ® to)) (94) 

is to be measured, where Q operates on 5 G i^(R) by {Qg){q) ■= qg{q)- The 
hope is that this expectation divided by e will approximate {s,As) for small e, 
and that the measurement will negligibly change the starting state s of S. 

At first glance, it seems plausible that for small e, the state (T^(g)/)e~*^('^' {s(S) 
to) would be close to s (8) m, and that the corresponding states of S, namely 
trM {V (g) /)(e~*-^('^)(s (g) to)) and tii^i (s (g to) = s would also be close. But the 
reader who attempts to rigorously justify this expectation will find that careful 
thought is necessary. One complication is that the operation tr^/ operates on 
an infinite dimensional space S (g) M , so that the sense in which it is continuous 
has to be carefully consideredEl But one welcome simplification is that the 
definition of "closeness" in S (or in density matrices on S) is not in issue: all 
norms on a finite dimensional space are equivalent, so we may define "closeness" 
in S by any convenient norm, and refiect this in language which would otherwise 
be sloppy. 

A priori, it is not enough to show that trj\/ P(y^i)e-'H(e) i^s^m) ^ Ps sls e ^ 
because of the possibility that the measurement of / (g) Q may affect the (mixed) 
state trAf P(yg,j)g-,H(.) |-^^,„) of S. 

For example, for infinite dimensional Af, the mapping u(X>v — > trj\/ Puigiv is not continuous 
with respect to the norm topologies, so e~''^^'^\s<S^m) s^m as e — > does not immediately 
imply that trjvf P^-iH{e)(a»m) — » trj^f Ps®m = Ps- 



45 



Since I ®Q has continuous spectrum, the precise manner in which the mea- 
surement of Q affects the state is not even defined by usual formulations of 
quantum mechanics 1^ 

It would be defined if Q had pure point spectrum, so to carry out a rigorous 
analysis we shall need to approximate Q by an operator with pure point spec- 
trum. To approximate Q in norm within A > by such an operator U, define 
B for g e L^CR) by 

{Bg)iq):=X[q/X]giq) , (95) 

where the bracket denotes the greatest integer function: for x e R, [x] denotes 
the greatest integer less than or equal to x. The function Bg need not be in 
L^(R), but restricting to the set of all g for which Bg is in L^(R) makes B 
Hermitian (we omit the details). 

Then B has the set {fcA}^_j^ of integer multiples of A as pure point spec- 
trum. For each integer k, let Pk denote the projector on the eigenspace of B 
with eigenvalue kX. This eigenspace consists of all functions with support in 
[fcA, (fc + 1)A). 

A measurement of B is the same as a projective measurement with respect 
to the resolution of the identity {J® Pk}'^_a^- If the composite system is in 
normalized pure state r € S ® M before a measurement of B, then after the 
measurement, it will be in (unnormalized) pure state {I®Pk)f' with probability 
|(/ ® Pfc)rp, i.e., in mixed state Yl,k ® Pk)r\'^ P{i^p^,)r- The state of S will 
then be 

5]lU®^fe)^l'trAfP(/®P,)r • (96) 

k 

Let {flj}"^]^ be an orthonormal basis for S of eigenvectors aj of A: Aaj = 
OijO-j- For our given s £ S", let s = '^j^j be its expansion as a linear combi- 
nation of these eigenvectors. We are interested in the case 

r := (y (g) /)e~*-^'^' (s (g) m) = ^ cfjVaj ® m^a, = ^ Vaj (g) ajuiea^ , (97) 

j j 

where mp denotes the translate of m by /3: m[}{q) :~ m{q — j3) for all (7 G R. 
By formula of Appendix 1 (with the roles of 5* and M reversed), the i,j 
matrix element of trjv/ P(^i(^p^y with respect to the orthonormal basis {l^a^} is: 

(t/flj, trM P(^itg,p^)rVaj) 

_ (Pkmea,,Pkm^a,)cr*<7i 

\{I<S>Pk)r\^ ■ ^ ' 

Substituting in ((96)) gives the state of S after the projective measurement, ex- 
pressed as a density matrix with respect to the orthonormal basis {Vaj}, as 

^(Pkmea,,Pkmea,)crj(Ji . (99) 

fc 

^^If one tries to substitute "Dirac delta functions" for eigenvectors of Q one is led outside 
the Hilbert space L-^(R). For example, if one measures Q and obtains the value 3.2, one 
cannot say that the subsequent state of the system is g h-> 5{q — 3.2). So far as the author 
knows, this problem has never been resolved in a complete and rigorous way. 
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Our goal is to show that the hmit as e of expression (|99p is Pg (which 
is s expressed as a density matrix). This is immediate if it is legitimate to 
interchange this limit with the summation in ([M]) because 

k 

= ^{Pkm,Pkm)a*ai 

k 
k 

since {Pk} is a resolution of the identity and ||to|| = 1. The operator on S 
whose matrix with respect to the basis {T^flj} is given by (jlOOp as {cjCn) is the 
projector onto the vector 

" ^ ^ '^^ "^ =Vs^s . 

3 3 

Thus we have established weakness of the measurement of B if the meter state 
m satisfies 

limy^(PfcmeQ PfcmeaJ = lim(PfcmeQ Pfcrn^aJ . (101) 

k k 

It is legitimate to interchange limits in an expression like (jlOip if all of the 
indicated limits exist and if at least one of the inner limits is uniformly A 
simple hypothesis which assures this is that m have compact support. In that 
case, the sum on the left side of (|10ip is actually finite with a maximum number 
of nonzero terms independent of e for small e, so the situation is trivial. Since 
physical "meters" have a bounded range of possible readings, this hypothesis is 
physically reasonable. 

Of course, many other conditions will also sufhce. For those who like to work 
with Gaussians, if m is a Gaussian (or more generally, if m{q) decays sufficiently 
rapidly as g — > ±cx)), uniform convergence of the sum follows from brute force 
upper bounds on |to((j')| for large q. 

As for existence of the limits, the limits in the right-hand expression in 
(|101|) obviously exist. If the left-hand limits are in question, one could pass 
to a sequence {eij^i with e; ^ for which limz^oo ' 
does exist. (Routine compactness arguments assure the existence of such a 
subsequence.) Then the uniformity shows that all the double limits are equal 
(and that the left-hand limit in (jlOip actually does exist). 
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